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Abstract 

Recently there has been much interest in Gowers uniformity norms from the perspective of 
theoretical computer science. This is mainly due to the fact that these norms provide a method 
for testing whether the maximum correlation of a function / : F™ — > F p with polynomials of 
degree at most d < p is non-negligible, while making only a constant number of queries to the 
function. This is an instance of correlation testing. In this framework, a fixed test is applied 
to a function, and the acceptance probability of the test is dependent on the correlation of the 
function from the property. This is an analog of proximity oblivious testing, a notion coined by 
Goldreich and Ron, in the high error regime. 

In this work, we study general properties which are affine invariant and which are correlation 
testable using a constant number of queries. We show that any such property (as long as the 
field size is not too small) can in fact be tested by Gowers uniformity tests, and hence having 
correlation with the property is equivalent to having correlation with degree d polynomials for 
some fixed d. We stress that our result holds also for non-linear properties which are affine 
invariant. This completely classifies affine invariant properties which are correlation testable. 

The proof is based on higher-order Fourier analysis. Another ingredient is a nontrivial 
extension of a graph theoretical theorem of Erdos, Lovasz and Spencer to the context of additive 
number theory. 
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1 Introduction 



Blum, Luby, and Rubinfeld [2] made a beautiful observation that given a function / : F™ — > ¥ p , 
it is possible to inquire the value of / on a few random points, and accordingly probabilistically 
distinguish between the case that / is a linear function and the case that / has to be modified 
on at least e > fraction of points to become a linear function. Inspired by this observation, 
Rubinfeld and Sudan [14] defined the concept of property testing which is now a major area of 
research in theoretical computer science. Roughly speaking to test a function for a property means 
to examine the value of the function on a few random points, and accordingly (probabilistically) 
distinguish between the case that the function has the property and the case that it is not too 
close to any function with that property. Interestingly and to some extent surprisingly these tests 
exist for various basic properties. The first substantial investigation of property testing occurred 
in Goldreich, Goldwasser, and Ron [3] who showed that several natural combinatorial properties 
are testable. Since then there has been a significant amount of research on classifying the testable 
properties in combinatorial and algebraic settings. 

The theory of uniformity, initiated by the proof of Szemeredi's theorem [15] . plays an important 
role in the area of property testing. In [6] Gowers in a Fourier-analytic proof for Szemeredi's theorem 
introduced a new notion of uniformity, defined through Gowers uniformity norms. This notion has 
an interesting implication in the context of property testing. The Gowers norm of a function can 
be expressed as an average of values of this function on a few random sample points. There are 
theorems which show that a bounded function has non-negligible Gowers uniformity norm if and 
only if it has a non-negligible correlation with a low degree polynomial. These two facts show 
that having correlation with low degree polynomials is testable. Our main result is that, roughly 
speaking, the only correlation testable families of functions are the ones that can be tested by 
Gowers uniformity norms. 

The tests that we study in this article are slightly different in nature from the typical statements 
in the area of property testing. Typically in property testing the goal is to distinguish the functions 
that are in a set T> (structured) from the functions that are in a non-negligible distance from every 
element in T>. For example linearity testing [2], probably the most celebrated result in this area, 
says that it is possible to test whether a function / : FJ> — > F2 is linear or that it has to be modified 
on at least a non-negligible fraction of all points in order to become a linear function. In this article, 
we are interested in a different kind of tests. Here we seek a weaker structure in the function, and 
having correlation with an element in T> replaces the usual condition of actually being in T>. This 
is in the spirit of proximity oblivious testing, a notion introduced by Goldreich and Ron [5] in the 
context of graph properties. 

1.1 Notations 

For a natural number k, denote [A;] := {1, . . . , k}. The complex unit disk is denoted by B = {z € 
C : 1 2! I < 1}. We will usually use the lower English letters x,y,z to denote the elements of F™. For 
x € Fp, and i G [n], x(i) denotes the i-th coordinate of x, i.e. x = (x(l), . . . , x(n)). We frequently 
need to work with the elements of (F™) fc which we regard as vectors with k coordinates. These 
elements are denoted with bold font e.g. x = (xi, . . . , x^) € (F™) fc . For 1 < i < n, &i denotes the 
i-th standard vector in Wl. Capital letters X, Y, etc are used to denote random variables. 

— m 

For an element m € F p , we use the notation e p (m) := e f . We denote by /, g, . . . functions 
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from F™ to C, and by f, g, . . . functions from F™ to F p . We denote / := e p (f) if f{x) = e p (f(x)). We 

denote n-variate polynomials over F™ by P, Q, 

The bias of a function / : F™ — >• C is defined to be the quantity 



bias(/) := E™[/(X)] 



(1) 



The bias of a function f : F p — > F p is defined to be bias(f) := bias(e p (f)). The inner product of two 
functions /, g : F™ — >■ C is defined as 



(f,g) :=¥, Xe¥ n[f(X)g(X)}. 



The correlation of a function / : ¥ r 



C with a set D of functions from F™ to C is defined as 



u{D) 



sup |(/, • 

geD 



(2) 
(3) 



By an abuse of notation, if D is a set of functions from F p to F p , we define 

lu(-D) : = 



sup |(/,e p (g))| 



Note that || • is always a semi-norm. 

For two vector spaces V and W over F p , let Lin(V, W) denote the set of all linear transformations 
from V to W. Let Aff(n,F p ) denote the group of all invertible affine transformations from F™ to 
itself. For a function / : F" — > C, and an A € Aff(ra,F p ), we denote by Af the function that maps 
x to /(.Ax). 



1.2 Gowers Uniformity Norms 

Gowers uniformity norms are defined in the more general setting of arbitrary finite Abelian groups, 
but in this article we are only interested in the case where the group is F™. 

Definition 1.1 (Gowers uniformity norms). Let G be a finite Abelian group and f : G — > C. For 

an integer k>l, the k-th Gowers norm of f, denoted \\f\\jjk is defined by 



2 k 



E 



II c^ffx + ^y] 

SC[k] V ieS / 



(4) 



where C denotes the complex conjugation operator, and X, Y\ , . . . , 1^ are independent random vari- 
ables taking values in G uniformly at random. 



These norms were first defined in [6] in the case where G is the group Z^r. Note that H/Ht/i 
|E[/(X)]|, and thus || • is a semi-norm rather than a norm. The facts that the right-hand side 
of flU) is always non- negative, and that for k > 1, || • \\jjk is actually a norm are easy to prove, but 
not trivial (see [6] for a proof). 

Let us explain the relevance of the uniformity norms to the area of property testing. The goal 
in property testing is to obtain certain information about a function by "reading" its values only 
on a small number of points. The simplest case is that of testing correlation with linear functions. 
For a G F p , let i a : F™ — >• F p be the corresponding linear function, defined as i a {x) = Y17=i o-(i)x(i). 
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Let Linear = {£ a : a € F™} be the set of linear functions. Let f : F™ — > ¥ p be a function, and let 
/ = e p (f). The correlation of f with linear functions is given by 

||/|U(Lincar) = „ max \(f,e p (t a ))\ = maxbias(f - £ a ), 

which is the same as the maximum of the absolute values of the Fourier coefficient of /. It is 
known 0E] that the correlation of f with linear functions is related to the U 2 norm of /. Specifically, 
for every e > 0, 

• Direct Theorem: If ||/||u(Lmear) — e ) 

then ||/||[/2 > e. 

• Inverse Theorem: If ||/ > e, then || y 1 1 ix(Linear) — e • 

These two facts together show that ||/||{/2 gives a rough estimate for the maximum correlation of 
f with linear functions. Recall that 

WfWm = h P {t)tm = ne P (i(x + Y + Z) - f(X + Y) - f(X + Z) + f(X))}, 

where X, Y, Z € F™ are uniformly chosen. In other words, the joint distribution of l(X + Y + 
Z),£(X + Y),f(X + Z),f(X) allows to distinguish between the case that f has correlation at least 
e with linear functions, and the case where f has correlation at most 5 = e 2 /2 (say) with linear 
functions. Hence, correlation with linear functions is "testable with just 4 queries to f" (for every 
e > 0). 

Similar to the case of linear functions, correlation with degree d polynomials can be tested by 
the U d+1 uniformity norm. The main ingredient is the inverse theorem for F^, recently proved 
in [HE]. Let Poly rf (F") be the set of polynomials of degree at most d over F™. These results show 
that, as long as p > d (i.e. the field is not too small 0), for every e > 0, 

• Direct Theorem: If ||/|U(p iy d (F«)) > e, then H/H^+i > e. 

• Inverse Theorem: If H/Hyd+i > e, then ||/|| u (p iy d (F™)) — ${ e ) > 0- 

The exact dependency of 5(e) on e is currently unknown, but crucially 5 does not depend 
on n. Analogously to the case of linear functions and U 2 , we get that the joint distribution of 
(i(X + J2iei Yi) : I Q [d + 1]) where X, Yi, . . . , Yd+i £ FL 1 are uniformly chosen, distinguishes the 
case where f has noticeable correlation (> e) with degree d polynomials, from the case where f has 
negligible correlation (< 5(e) /2) with degree d polynomials, for every e > 0. Hence, correlation 
with polynomials of total degree d is testable with just 2 d+l queries to f (for every e > 0). 

An equivalent qualitative formulation of the direct and inverse theorems stated above is as 
follows. Let (f n : F™ — > ¥ p ) n( z^ be a sequence of functions. Then as long as d < p we have that 

lim ||e p (f n )|| u (p ol (F „ )) = 0«==> lim 1^(^)11^+1=0. 

It will be convenient to express our results in such terms. 

1 Recently, the inverse theorem for the case p < d was proved in |16| . In this range the role of polynomials is 
replaced by non- classical polynomials. As our techniques are limited to the case of p > d, we do not expand on this. 
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1.3 Correlation testable properties 

The above discussion motivates the general definition of correlation testable properties. Let T> = 
{D n }n£N be a family of sets, where each D n is a set of functions from ¥ p to ¥ p . Informally, T> 
is correlation testable using q queries if there exists a distribution over x±,. . . ,x q G F", and for 
every e > there exists 5(e) G (0,e), such that the following holds. The joint distribution of 
(f(xi), . . . , f(x q )) allows to distinguish between the case that f has noticeable correlation (> e) with 
T> and the case that f has negligible correlation (< 6(e)) with T>. 

Definition 1.2 (Correlation testable properties). A family V = (D n ) is correlation testable with q 
queries, if there exists a distribution /j, taking values in (¥ p ) q and a mapping T : ¥ p — > {0, 1} ; such 
that the following holds. For every e > 0, there exist 5 G (0,e), < 6~ < + < 1 and uq G N, such 
that for every n > uq and f : F™ — > ¥ p we have: 

• If IMf)|L(D n ) > e then Pr (Xl ,... iXq) ^[r(f(X 1 ), . . . , f(X ? )) = 1}>6+. 

• // ||ep(f)||„(u n) < 5 then Pr (Xli ... iXg) ^[r(f(X 1 ), . . . ,f(X q )) = 1]<0~. 

Following the discussion on Gowers uniformity norms, Poly d = {Poly d (F")} n6 N is correlation 
testable using q = 

2 d+i 

queries, as long as d < p. 
Our goal is to study families V = {-D n }neN which are correlation testable using a constant 
number of queries. We shall require the sets D n to be consistent with each other: 

• Al: (Consistency) For positive integers m > n and g G D n , the function h : F™ —¥ ¥ p 
defined as h(xi, . . . , x m ) = g(xi, . . . , x n ) belongs to D m . 

We need to make also a more crucial assumption. In this general setting, the algebraic structure 
of Fp is ignored, and we are treating ¥ p as a generic set of size p n . In order to take the algebraic 
structure of F™ into account, we shall require D n to be affine invariant: 

• A2: (Affine invariance) For every positive integer n, if g G D n , then for every A G 
Aff(n,F p ), we have Ag G D n . 

Invariance plays a crucial role in the area of property testing. We refer to the work of Kaufman 
and Sudan [13] for the role of invariance in algebraic property testing. We stress that we do not 
require V to be linear; i.e. we do not require that if f , g G D n then also f + g G D n . 

The last condition relates to the size of D n . We study families where one can distinguish 
functions with noticeable correlation from functions with negligible correlation. In order not to 
make this meaningless, we would like to have functions with negligible correlation. For example, 
we would like a random function not to have correlation with V with high probability. Fix a 
function g G D n . The number of functions f : F^ — > ¥ p which have correlation at least 5 with 
g is p c ^'P" where lim^o c(<5) = 1. Thus, a sufficient condition for random functions not to be 
correlated with D n is that the size of D n is p°( p ™\ We thus add the following requirement. 

• A3: (Sparsity) For every e > and large enough n, we have \D n \ < p £pn . 

We call every T> = {Z) n } ng N satisfying assumptions Al, A2, A3 a proper dual. 

Our main result is the following theorem which roughly speaking says that the only correla- 
tion testable families of functions are the ones that can be tested by Gowers uniformity norms 
provided that the field size p is not too small. Say a sequence of functions (f n ) ng pj is unbiased if 
Hindoo bias(f n ) = 0. 
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Theorem 1.3 (Main theorem). Consider a proper dual T> = {D n } n( zfq. IfT>is correlation testable 
with q queries, and if p > q — 1, then there exists 1 < k < q — 1 such that the following holds. For 
every unbiased sequence of functions (f n : F™ — > ¥ p ) n( zfq, we have 

lim ||e p (f n )|| u (D„) = lim ||e„(f ri )|| c/ A i +i = 0. 

Equivalently, Theorem 11.31 can be phrased as follows: the sequence (f n )neN has correlation with T> 
if and only if it has correlation with polynomials of degree at most k. 

We remark on the seemingly odd requirement that f n is unbiased. Let /i be some distribution 
over F p , and let f n : F™ — > F p be a random function, where each f n (x) is sampled independently 
according to fi. By condition A3, since \D n \ = p°(P™\ we have that with probability 1 — o n (l), 

lim ||ep(fr»)|| u (z?„) = 0. 

However, if \x has a nonzero bias, then almost surely f n will have correlation with constant functions. 
It turns out that ruling out sequences of functions which have correlation with constant functions 
is sufficient for establishing Theorem 11.31 



Paper organization We give a short overview of the proof of our results in Section [2j We 
discuss systems of linear forms in Section [3j We survey higher-order Fourier analysis in Section [U 
We prove the main result for correlation testable families in Sectional We prove the extension of 
Erdos-Lovasz-Spencer theorem in Section [71 We give some concluding remarks and pose some open 
problems in Section [U 



2 Proof overview 



In this section we give an overview of the proofs of our main results. We skip most of the techni- 
calities, and try to emphasis the essence of the proofs. However if the reader wishes, he/she can 
skip this section and start reading the formal definitions and proofs starting at Section [3l 

The proof of Theorem ll.3l is based on studying averages of functions evaluated on linear forms. 



2.1 Linear forms 

A linear form in k variables is a vector L = (Ai, . . . , Xk) € F^ regarded as a linear function from V k 
to V, for every vector space V over F p : If x = (xi, ■ ■ ■ , x^) £ V k , then L(x) := AiXi + . . . + \k%k- 
A system of linear forms in k variables is a finite set C = {L\, . . . , L m } of distinct linear forms Li 
in k variables. For a function / : F™ — > C, and a system of linear forms C = {L±, . . . ,L m } in k 
variables, define the average 



tc(f) :=E 



n/(^( x )) 



i=l 



(5) 



where X is a random variable taking values uniformly in (F™) . We define two generalizations of 
such averages. First, when / : F" — >• C, one may take conjugations. For a € {0, l} m , define 



tcAf) ==E 



Y[C a ^f(L t (X)) 



i=l 



(6) 
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where C is the conjugation operator. Second, if f : F™ — > ¥ p , one may take coefficients in ¥ p . For 
/3 G F™, define 



ted*) ==E 



vi=l 



(7) 



We note that for functions f : FIJ — > F p , (|7|) is more general than ([6]). Indeed, if a G {0, l} m , then 
setting /3(i) = (-1)«W gives i Aa (e p (f)) = ^(f). 

Let P be a proper dual family which is correlation testable. Then since T> is afhne invariant, 
it is easy to show that correlation with T> can be expressed in terms of averages t*£g. We show 
in Lemma 16.21 that for every e > there exist 5 G (0, e), systems of linear forms £i,...,Cg and 
corresponding coefficients f3\, . . . ,/3t, and no G N, such that the closures of the following two sets 
are disjoint: 



Te := m iA (f), ■ ■ ■ ,t% eA m n > n oJ : K -+ Fp, ||e p (f)|| u(Z)n) > e}, 



and 



S £ :=m iA (f),...,fc e ,p e (f))\n>no,f-.F; 



u(D n ) < $}■ 



Moreover, each of the systems Ci, ■ ■ ■ has at most q linear forms. 

Recall that D is the unit disk in C. It turns out to be easier to first analyze sets with a stronger 
requirement which holds for all functions / : F™ — > B, and not just for functions of the form 
/ = e p (f) for f : F™ — > ¥ p . Say a family T> is strongly correlation testable if the closures of the 
following two sets are disjoint: 



and 



T £ ■= {(t Cl Mf), ■ ■ ■ ,tc t ,<u(J))\n > "o, / : F£ 
S £ := {(tc uai (f), • • • ,t Cl , at (f))\n > n J : F™ 



u (D n ) > e}, 

u{D„) < 



2.2 Approximation by averages over polynomial factors 

We turn to analyze averages of the form t£ jCe (f), and more generally averages of the form 
E \JXJLiC a{i) fi(Li(X))) where X G (¥%) k is uniform. A crucial ingredient is that we can "approx- 
imate" fi by nice functions, and that these approximations are essentially undetected by averages 
of the above form. These nice functions will be averages of f\, . . . , f m on polynomial factors. 

A polynomial factor B is a partition (sigma-algebra) of F^ defined by a collection of polynomials 
Pi, ... , Pc- The atoms of the partitions are 

{x G F™ : Pi(x) = o(l), . . . , P C (x) = a(C)}, 

where a(l), . . . , a(C) G ¥ p . The degree of B is the maximal degree of Pi, ... , Pc, and the complexity 
of B is C. 

In Definition 14.61 to every set of polynomials V we will assign a natural number rank("P). The 
rank of a single polynomial P is defined to be rank({P}), and rank of a polynomial factor B is the 
rank of the set of polynomials defining it. 

The average of a function / : F™ — >• D over a polynomial factor B, denoted F,(f\B) : F™ — > B, is 
defined as 

E(f\B)(x) = E{ye^:P 1 (y)=Px(x),„„P c (y)=P c (x)}[fiy)}- 
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That is, E(/|B)(x) is the average of / in the atom to which x belongs. The usefulness of these 
averages is that they may be used to approximate the function /. It is known (see Theorem 14, 8p 
that given any d < p and 5 > 0, for every function / : F™ — > D, there exists a polynomial factor B 
of degree d and bounded complexity such that 

\\f-E(f\B)\\ ud+1 <S. 

As we shall see, this allows us to replace any arbitrary function / : F™ — > D with a structured 
function K(f\B) without changing averages t£ jCt significantly. Note that E(/|B) is immeasurable, 
that is, it is constant on the atoms of B. Thus, it will suffice to study averages tc a (g) where g is 
a ,6-measurable function. 

As we shall see, we may assume that all the systems of linear forms arising in our proofs are 
homogenous. Informally, a system of linear forms is homogeneous if (maybe after some change of 
basis) we have Li(xi, . . . , Xj~) = L'^xi, . . . , a^-i) + xj.. One of the key ingredients in the proof of 
our main result is the following invariance theorem which is established in 

Informal Statement (Proposition |4.9j) . Let V = {P\, . . . , Pc}, Q = {Qi,---,Qc} be two col- 
lections of polynomials over F™ of degree at most d < p such that deg(Pj) = deg(Qj) for every 
1 < i < C. Let C = {L\, . . . , L m } be a homogeneous system of linear forms, and T : Fp — > D be an 
arbitrary function. Define f, g : F™ — > D by 

f(x)=T(P 1 (x),...,P c (x)) 

and 

g(x)=T(Q 1 (x),...,Q c (x)). 
Then for every e > 0, if rank('P), rank(Q) are large enough, then for all a € {0, l} m , 

\tcAf)-tcAa)\<e. 

2.3 Interior of a set of averages of linear forms 

Another component in the proof of Theorem 11.31 is an extension of a result of Erdos, Lovasz, and 
Spencer [3] to the setting of additive combinatorics. We show (Theorem 17. ip that if C\, . . . , Ci are 
systems of linear forms, then unless there are some trivial obstructions, there exists N € N such 
that the set 

{(^(/),...,i A (/)):/:F^^[0,l]}c^ 

has a nonempty interior. Formally, for the theorem to hold, we require the systems of linear forms 
to be non-isomorphic and connected. 

Two systems of linear forms C = {L^, . . . , L' m } and C" = {L'(, . . . , L'^} are isomorphic, if the 
distributions (L^(X), . . . , L' m (K)) an d (L"(X), . . . , L^(X)) are identical (after rearranging them if 
necessary). Note that if £.',£." are isomorphic then 

tc>(f) = t c »(f), 

for all functions / : F™ -)■ B. 

A system of linear forms C = {L\ , . . . , L m } is connected if it cannot be partitioned as C = £iU£2 
where span(£i) n span(£2) = {0}. Note that if £ is not connected, then 

tc(f)=t Cl (f)tc 2 (f), 

for all functions / : F" — > B. 



9 



2.4 Proof for strongly correlation testable families 

Let T> = {-D n } n£ N be a family which is strongly correlation testable. Assume for simplicity of 
exposition in the proof overview that the following holdall. There exists a system of linear forms 
C\, . . . ,Ci such that for every e > 0, there exists 5 £ (0, e) and no £ N, such that the following two 
sets are disjoint: 

T e := {(t Cl (f), • • • , t Cl {f))\n > no,/ : F£ -»• B, ||/|| u(Dn) > e}, 

and 

S £ := {(t^C/), • • • ,*£*(/))|n > n , / : F£ ^ B, \\f\\ u(Dn) < 6}. 

We can furthermore assume that these systems are homogeneous, and there is a number s < p such 
that for all i £ [£], 

\tc t (f)-tcM\^0, 

if /, g : F™ — > B satisfy ||/ — <7||i/*+i ~ 0. This follows from the assumption that the number of 
queries is q < p. 

Let t £ N be maximal such the following holds. There exist polynomials Q n of degree exactly t 
such that 

• Q n has noticeable correlation with D n : that is liminf n __ i . 00 ||e p (Q n )|| u (£> n ) > 0. 

• Q n has "large enough rank" (exact definition is deferred to the actual proof). 

We can show that t < s: if Q n are polynomials of degree greater than s and large enough rank, then 
|| e p(Qn)||j7 s + 1 ^ an d hence i£ i (e p (Qn)) ~ 0, for all i £ [£]. Thus we must have ||e p (Q n )|| u (£) n ) ~ 0. 

Let f n : F™ — > B be a sequence of functions. We establish the main result (for strongly 
correlation testable families) by showing that: 

lim \\fn\\u(D n ) = <^ hm \\f n \\uw = 0. 

Proof of <^=: Assume by contradiction that \iva. n ^oo \\fn\\u t+1 = but that e : = 
liminfn^oo ||/n||«(D„) > 0. Let e' > be the distance between T e and S e (here we use the 
fact that their closures are disjoint, hence there is positive distance between the sets). Let B n be 
a polynomial factor of degree s such that \\f n — E(/ ra |£> n )||;y s +i < rj. Choosing r/ > small enough 
we can guarantee that \tci(f n ) ~ *£i(F(/n|B n ))| < e'/2, hence we must have \\E(f n \B n )\\ u ( Dn ) > 5. 

Assume B n is defined by polynomials P n ,i> • • • ■> Pn,C, which we can assume to be of high enough 
rank. For 7 £ F^ , define P n , 7 (x) := El(i)Pn,i( x )- We have E(/„|B n )(x) = ^ 7 
where |c 7 | < 1. Using the assumption that lmin^oo ||/ n ||{7t+i = 0, we show that if deg(P 7 ) < t, 
then its contribution to the sum is negligible, i.e. c 7 can be assumed to be arbitrarily small. Thus, 
there must exist a polynomial P nj7 of degree at least t + 1 such that ||ep(P n)7 )|| w f£) n ) > 5p~ c . As 
this polynomial has "large enough rank", this contradicts the maximality of t. 

2 In the actual proof we have generalized averages i£ i)Q;< (/). 
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Proof of =>: Assume that e := linin^oo ||/n||[/' t + 1 > (actually we have liminf, which we can 
replace by an actual limit by Condition Al). Thus, there exist polynomials P n of degree at 
most t such that |(/ n! e p (-P n ))| > £■ We can assume these polynomials to have "large enough 
rank". Assume first that these polynomials are of degree exactly t. By the definition of t, there 
exist polynomials Q n of degree t and "large enough rank" such that ||e p ((5 n )|| u m n ) > e'. We use 
the invariance theorem (Proposition I4.9P to construct a new sequence of functions f' n : ¥ p — > B 
(essentially replacing P n with Q n ) such that 

• f n has correlation with Q n - In fact, this correlation is strong enough to guarantee that 
ll/nlU(r> n ) — e " f° r some £ " > independent of n. 

• Averages tc- cannot distinguish /„ from f' n . So, we must also have ||/n|| u (_D n ) > 8" > 0. 

The case where P n have degrees less than t is reduced to the case of degree t. We use the theorem on 
the interior of a set of averages of linear forms to argue that we can (essentially) tweak f n slightly, 
without changing averages t/v by much, but such that f n will have correlation with polynomials of 
degree exactly t and large enough rank. 



2.5 Proof for correlation testable families 

The proof for the correlation testable families (i.e. with guarantees only for functions f : ¥ p — > ¥ p ) 
follows similar steps, albeit slightly more involved. Let P(¥ p ) C W denote the convex set of 
probability distributions on ¥ p . Then correlation testable families can in fact test randomized 
functions as well, i.e. functions f : F™ — > P(¥ p ). Once this is established the remainder of the proof 
follows identical lines to the case of strongly correlation testable families, where we employ the fact 
that -P(Fp) is a convex set to appropriately define averages such as E(f|£>). 



3 Systems of linear forms 



A linear form in k variables is a vector L = (Ai, . . . , A&) 6 F^ regarded as a linear function from V k 
to V, for every vector space V over F p : If x = (x\, . . . , Xk) € V k , then L(x) := AiXi + . . . + XkXk- 
A system of m linear forms in k variables is a finite set £ = {Li, . . . , L m } of distinct linear forms, 
each in k variables. For a function / : ¥ p — > C, and a system of linear forms C = {L\, . . . , L m } in 
k variables, define 

m 

n/(^( x )) 



tdf) ■= F 



i=l 



(8) 



where X is a random variable taking values uniformly in (F™) . More generally, we consider 
two types of generalized averages. First, when / : ¥ p — » C takes complex values, we allow also 
conjugation. Let C denote the conjugation operator C(z) = z. Then for a S {0, l} m define 



tcAf) ==E 



Y[C a ^f(L t (X)) 



Note that t£,o m (-) = tc(')- Second, when / := e p (f) where f : F^ 
For P € ¥ p n , define 



(9) 



F„, we also allow coefficients. 



ep £^(i)f(L,(X)) 



(10) 



11 



We note that for functions f : F" — > ¥ p , the latter average is more general than the first one: for 
every a £ {0, l} m , let /3 e {-1, l} m be set as /3(i) = (-l) a W; then ^(f) = i£, a (e p (f)). 

Note that if A C F™ and 1a '■ F™ — > {0, 1} is the indicator function for A, then ^(Ia) is the 
probability that Li(X), . . . , L m (X) all fall in A. Roughly speaking, we say A C F™ is pseudorandom 
with regards to £, if 



that is if the probability that all Li(X), . . . ,L m (X) fall in A is close to what we would expect if 
A was a random subset of F™ of size |^4|. Let a = \A\/p n be the relative measure of A, and define 
f(x) := Ia(^) — ol. We have 

i£(U) = t c (a + f) = a m + Yl am ~ lSl ■ hwesyif). 

SC[m],SJ$ 

So, a sufficient condition for A to be pseudorandom with regards to C is that tr L .. ie s\(f) rj for 
all nonempty subsets S 1 C [m]. Green and Tao [10] showed that a sufficient condition for this to 
occur is that ||/||{7«+i is small enough, where s is the Cauchy-Schwarz complexity of the system of 
linear forms. 

Definition 3.1 (Cauchy-Schwarz complexity). Let C = {Li, . . . , L m } be a system of linear forms. 
The Cauchy-Schwarz complexity of C is the minimal s such that the following holds. For every 
1 < i < m, we can partition {L^j^y^^ into s + 1 subsets, such that Lj does not belong to the 
linear span of each such subset. 

The reason for the term Cauchy-Schwarz complexity is the following lemma due to Green and 
Tao |10] whose proof is based on a clever iterative application of the Cauchy-Schwarz inequality. 

Lemma 3.2 (|10|). Let fx, . . . , f m : F p — > B. Let C = {L\, . . . ,L m } be a system of m linear forms 
in k variables of Cauchy-Schwarz complexity s. Then 



^xe(F£)* 



n^ x )) 



< min WfiWus+i. 

Ki<m 



Note that the Cauchy-Schwarz complexity of any system of m linear forms in which any two 
linear forms are linearly independent (i.e. one is not a multiple of the other) is at most m — 2, since 
we can always partition {-£j}je[m]\{i} h^o the m — 1 singleton subsets. 



3.1 The true complexity of linear forms 

The Cauchy-Schwarz complexity of C gives an upper bound on s, such that if is small 

enough for some function / : F^ — > B, then / is pseudorandom with regards to C. Gowers and 
Wolf [7| defined the true complexity of a system of linear forms as the minimal s such that the 
above condition holds for all / : F^ — > B. 

Definition 3.3 (True complexity |7J). Let C = {Li, . . . , L m } be a system of linear forms over F„. 
The true complexity of C is the smallest d € N with the following property. For every e > 0, there 
exists 5 > such that if f : F^ — > B is any function with \\f\\jjd+i < 5, then 

\tc(f)\<e. 
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An obvious bound on the true complexity is the Cauchy-Schwarz complexity of the system. 
However, there are cases where this is not tight. Gowers and Wolf [8] characterized the true 
complexity of systems of linear forms, assuming the field is not too small. For a linear form 
LeF;, let L k G F™ be the /c-tensor power of L. That is, if L = (Ai, . . . , A m ), then 



L 1 ' - I JjA i3 . :ii,...,i k G [m\ 



GF™ 



Theorem 3.4 (Characterization of the true complexity of linear systems, Theorem 6.1 in [8]). 
Let £ = {L\, . . . , L m } be a system of linear forms over F™ of Cauchy-Schwarz complexity s, and 
assume that s < p. The true complexity of C is the minimal d such that Lf +1 , . . . , Lf^ 1 are linearly 
independent over ¥ p . 

The following theorem is proved by the authors in |llj . 

Theorem 3.5 ([H]). Let C = {Li, . . . ,L m } be a system of linear forms of true complexity d and 
Cauchy-Schwarz complexity at most p. Then for every e > 0, there exists 5 > such that the 



following holds. Let fi,gi : F™ 



for 1 < i < m be functions such that \\fi — gi\\ud+i < $■ Then 



where X G (F™) fc is uniform. 



fl/i(^(X)) 



.i=i 



n^(^(x)) 



i=l 



<e, 



Theorem 13.51 is an immediate corollary of the following theorem. 

Theorem 3.6 ([llj. Let C = {L±, . . . ,L m } be a system of linear forms of Cauchy-Schwarz com 
plexity at most p. Let d > 0, and assume that Lf +l is not in the linear span of L^ +1 
Then for every e > 0, there exists 5 > such that for any functions fi, ■ ■ ■ , f m '■ F p 
ll/illt/d+i < £j we have 



jd+l 

D with 



E 



i=i 



< e, 



where X G (F") fc 



is uniform. 



3.2 Equivalence of systems of linear forms 

Let S be a subset of Fp. Here we think of S as a linear structure, and we are interested in the 
number of affine copies of S in a set A C F™. Pick a linear transformation T G Lin(Fp,Fp) 
uniformly at random, and also independently and uniformly a random element X G F™. Then 
the density of the structure S in A is the probability that X + T(x) G A for every x G S. Note 
that a uniform random T G Lin(Fp,F") can be defined by mapping each standard vector G F^ 
uniformly and independently to a point in F^. Hence if the elements of a linear structure S are 
(A^i, . . . , A^fc) G F^ where 1 < i < m, then the density of the structure S in A is the probability 
that for every 1 < i < m, X + J2 j= i Ki Y j e A, where X,Y 1 ,...,Y m are i.i.d. random variables 
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taking values uniformly in F™. So if we define the system of linear forms C = {L\, . . . , L m } by 
letting 

Li = (1, Aj 5 i, . . . , \k)i (11) 

for 1 < i < m, then tci^A) gives the density of the structure S in a set A C F™. Note that since 
for a fixed c G F™, a uniform random variable X has the same distribution as X + c, for this 
system of linear forms £, the distribution of (Li(X), . . . , L m (X)) is the same as the distribution of 
(L 1 (X) + c,...,L m (X) + c). 

Definition 3.7 (Homogeneous linear forms). A system of linear forms C = {Li,...,L m } in k 
variables is called homogeneous, if for a uniform random variable X € (F p l ) fc ; and every fixed 
c G Fp, (Li(X), . . . , L m (X)) /ias i/ie same distribution as (Li(X) + c, . . . , L m (X) + c). 

We wish to identify two systems of linear forms Cq = {Li,...,L m } in ko variables, 
and C\ = {L' ± , . . . , L' m } in k\ variables, if (Li(X), . . . , L m (X)) has the same distribution as 
(L'^Y), . . . , L' m (Y)) where X and Y are uniform random variables taking values in (F") fc ° and 
(F™) fcl , respectively. The distribution of (Li(X), . . . ,L m (X)) depends exactly on the linear depen- 
dencies between L±, . . . , L m , and two systems of linear forms lead to the same distributions if and 
only if they have the same linear dependencies. 

Definition 3.8 (Isomorphic linear forms). Two systems of linear forms Cq and C± are isomorphic 
if and only if there exists a bijection from Cq to C\ that can be extended to an invertible linear 
transformation T : span(£o) span(£i). 

Note that if C = {Li, . . . , L m } is a homogenous system of linear forms, then {L\ (X), . . . , L m (X)) 
has the same distribution as (Li(X) + Y, . . . , L m (X) + Y), where Y is a uniform random variable 
taking values in F^ and is independent of X. We conclude with the following trivial observation. 

Observation 3.9. Every homogenous system of linear forms is isomorphic to a system of linear 
forms in which there is a variable that appears with coefficient exactly one in every linear form. 

Consider a system of linear forms C in F^, and note that for every / : F£ — > C, 



tc(f) = IE 



n nT(L)) 

LLeC 



(12) 



where T is a random variable taking values uniformly in Lin(span(£), F"). Suppose that there 
exists a non-trivial subset S C C such that 

span(S') n span(£ \S) = {0}. 

Then for every / : FJJ -> C we have 

tcU) = ts(f)t C \s(f)- 

This leads to the following definition. 

Definition 3.10 (Connected linear forms). A system of linear forms C is called connected, if for 
every non-trivial subset S C C, we have 

span(5) n span(£ \ S) ^ {0}. 
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4 Higher- order Fourier analysis 



The characters of F™ are exponentials of linear polynomials; that is for a 6 FJJ, the corresponding 



character is defined as Xc 



e p(SILi a i x i)- I n higher-order Fourier analysis, the linear polyno- 



mials a i x i are replaced by higher degree polynomials, and one would like to express a function 
/ : F™ — > C as a linear combination of the functions e p (P), where P is a polynomial of a certain 
degree. 



Polynomials: Consider a function f : F™ — 
f in the direction y as A y f(x) = f(x + y) — i(x) 



F p . For an element y € F" define the derivative of 



Inductively we define A 



f=A, fc (A 



J), 



for directions € F" We say that f is a polynomial of degree at most d, if for every 



yu 



,Vd+i e Fp, we have A^, 



-,Vd+i 



f = 0. The set of polynomials of degree at most d is a vector 



space over F p which we denote by Poly^(Fp). It is easy to see that the set of monomials 
where < h, ■ ■ ■ ,i n < P and ]Cj=i ij — d form a basis for Poly d (F"). So every polynomial P € 
Poly d (Fp) is of the from P(x) := ^ <ki,...,in x% i ■ ■ ■ x n ' ■> w bere the sum is over all 1 < i\, . . . , i n < p 
with YlJ=i ij — d, and are elements of F p . The degree of a polynomial P : Fp — > ¥ p , denoted 

by deg(-P), is the smallest d such that P E Poly d (Fp). A polynomial P is called homogenous, if all 
monomials with non-zero coefficients in the expansion of P are of degree exactly deg(P). 

For a function / : F™ — > C, and a direction y € F™ define the multiplicative derivative of / in 
the direction of y as A y f(x) 



f(x + y)f(x). Inductively we define A 



yi,--;Vk 



/ = A yfc (A 



■■■,Vk 



for directions yx,---,yk € Fp. Note that for every f : Fp 



Fp, we have 



A y e p (f) = e p (Aj / f). 



This shows that if f £ Poly d (F"), then 



Aj,ep(f) = 1. 



(13) 



Note that one can express Gowers uniformity norms using multiplicative derivatives: 



&y u ...,yJ(X) 



where X, Y± , . . . , Yk are independent random variables taking values in F™ uniformly. This together 
with (fl~3j) show that every polynomial g of degree at most d satisfies ||ep(g)||(/d+i = 1. 

Many basic properties of Gowers uniformity norms are implied by the Gowers-Cauchy-Schwarz 
inequality which is first proved in [6] by iterated applications of the classical Cauchy-Schwarz 
inequality. 

Lemma 4.1 (Gowers-Cauchy-Schwarz). Let G be a finite Abelian group, and consider a family of 
functions fs'.G—>C, where SC. [k]. Then 



E 



n c \/>i..Y . ^v, 

sc[k] 



ies 



< n wish-, 

SC[k] 



(14) 



where X, Y\ , . . . , Y/- are independent random variables taking values in G uniformly at random. 

A simple application of Lemma |4. II is the following. Consider an arbitrary function f : G 
Setting /0 := / and fg := 1 for every S ^ in Lemma ITTj we obtain 



\E[f(X)]\<\\f\\ uk . 



(15) 
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Equation (fT5j) in particular shows that if f,g : — > C, then one can bound their inner product 
with Gowers uniformity norms of f~g: 

\(f,g)\<\\m\uk- (is) 

Consider an arbitrary / : F™ — > C and a polynomial g of degree at most d. Let g = e p (g). Then for 
every yi, . . . , y d +i G F™, we have 

which in turn implies that H/^H^d+i = ||/||[/<m-i- Combining this with (JT6J) , we conclude that the 
correlation of / with any polynomial of degree at most d is a lower-bound for ||/||[/d+i: 

ll/llu(Poly d ) < II /II [/■*+!• ( 17 ) 

This provides us with a "direct theorem" for the U d+1 norm: If ||/|| u (p iy d ) > S, then ||/||[/d+i > 5. 
Recently Bergelson, Tao, and Ziegler [U [TT] established the corresponding inverse theorem in the 
high characteristic case. 

Theorem 4.2 ([IJ [IT]). If 1 < d < p, then there exists a function 5 : (0, 1] — > (0, 1] such that for 
every f : F™ -)• D, and e > 0, 

• Direct theorem: // ||/||u(p iy d ) > ^en > £. 

• Inverse theorem: If \\f\\xjd+i > s, then ||/|| u (Poly d ) — 3( £ )- 

An important application of the inverse theorems is that they imply "decomposition theorems" . 
Roughly speaking these results say that under appropriate conditions, a function / can be decom- 
posed as fi + fz, where f\ is "structured" in some sense that enables one to handle it easily, while 
fi is "quasi-random" meaning that it shares certain properties with a random function, and can 
be discarded as random noise. In the following we will discuss decomposition theorems that follow 
from Theorem 14. 2^ but first we need to define the polynomial factors on F™. 

Definition 4.3 (Polynomial factors). Let p be a fixed prime. Let P\,...,Pc £ Poly rf (F"). The 
sigma- algebra on F^ whose atoms are {x € F™ : P\{x) = a(l), . . . , Pc(%) = a(C)} for all a € F^ is 
called a polynomial factor of degree at most d and complexity at most C . 

Let B be a polynomial factor defined by Pi, ... , Pq- For / : F? — > C, the conditional expectation 
of / with respect to B, denoted E(/|B) : F£ -»• C, is 

E(f\B)(x) = E {ye¥ n,p l(y)= p lix): _ jPc(y)= p c{x)} [f(y)]. 

That is, E(/|Z3) is constant on every atom of B, and this constant is the average value that / 
attains on this atom. A function g : F™ — >■ C is immeasurable, if it is constant on every atom of 
B. Equivalently, we can write g as g(x) = T(Pi(x), . . . ,Pc(x)) for some function r : F^ -> C. The 
following claim is quite useful, although its proof is immediate and holds for every sigma-algebra. 

Observation 4.4. Let f : F™ — > C. Let B be a polynomial factor defined by polynomials Pi, ... , Pc- 
Let g : F™ — > C be any B-measurable function. Then 

(f,g) = (E(f\B),g). 
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Definition 4.5 (Bias). The bias of a polynomial P € Poly d (F") is defined as 

bias(P) := bias(e p (P)) = \E x&? [e p (P(X))]\. 

We shall refine the set of polynomials {Pi, . . . , Pt} to obtain a new set of polynomials with the 
desired "approximate orthogonality" properties. This will be achieved through the notion of the 
rank of a set of polynomials. 

Definition 4.6 (Rank). We say a set of polynomials V = {Pi, . . . , Pt} is of rank greater than r, 
and denote this by rank(P) > r, if the following holds. For any non-zero a = (a±, . . . , at) € F* 
define P a (x) := Y2j=i ctjPj{x). For d := max{deg(P ? ) : ay 7^ 0}, the polynomial P a cannot be 
expressed as a function of r polynomials of degree at most d — 1. More precisely, it is not possible 
to find r polynomials Qi, ■ ■ ■ ,Q r of degree at most d — 1, and a function T : F£ — > ¥ p such that 

P(x) = T(Q 1 (x),...,Q r (x)). 

The rank of a single polynomial P is defined to be rank({P}). 

The rank of a polynomial factor is the rank of the set of polynomials defining it. 

The following theorem due to Kaufman and Lovett |12j connects the notion of the rank to the 
bias of a polynomial. It was proved first by Green and Tao [9] for the case d < p, and then extended 
by Kaufman and Lovett [12] for the general case. 

Theorem 4.7 (Regularity [I2])' Fix p prime and d>l. There exists a function r p d : (0, 1] — > N 
such that the following holds. If P : F™ — > F p is a polynomial of degree at most d with bias(P) > e, 
then rank(P) < r Pj ^(e). 

The following Theorem is proved by the authors in |llj . 

Theorem 4.8 (Strong Decomposition Theorem - multiple functions). Let p be a fixed prime, < 
d < p and m be integers, let 5 > 0, and let r : N — > N be an arbitrary growth function, and suppose 
that n > no(p,d,5,m,r(-)) is sufficiently large. Given every set of functions /1, . . . , f m : F™ — > D, 
there exists a decomposition of each fi as 

f i = hi + h' { , 

such that 

hi:=E(fi\B), \\h^\ ud+ i<S, 

where B is a polynomial factor of degree at most d, complexity C < C max (where C max depends on 
p, d, 5, m, r(-) ) and rank at least r{C). Furthermore we can assume that B is defined by homogeneous 
polynomials. 

The following proposition, proved in [llj, is one of the key ingredients in the proof of our main 
result, Theorem 11.31 

Proposition 4.9 (An invariance result |11|). Let p be a fixed prime. There exists a function 
r : N x (0, 1] — > N such that the following holds. Let V = {Pi, . . . , P^}, Q = {Qi, . . . , Qk} be two 
collections of polynomials over F™ of degree at most d < p such that deg(Pj) = deg(Qj) = d{ for 
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every 1 < i < k. Let C = {L±, . . . , L rn } be a system of linear forms, and T : — > D be an arbitrary 
function. Define f, g : F™ — > B fry 

f(x) = T(P 1 (x),...,P k (x)) 

and 

g(x) = T(Q 1 (x),...,Q k {x)). 
Then for every e > 0, z/ rank('P), rank(Q) > r(p,d,e), we have 

\t c {f)-t c {g)\<2p mk -e, 

provided that at least one of the following two conditions hold: 

(i) The polynomials Pi, . . . , P k and Qi, ■ ■ ■ ,Qk are homogenous. 

(ii) The system of linear forms C is homogenous. 



5 Characterization of strongly correlation testable properties 



Consider a family T> := {D n } n( zf>} where D n is a set of functions from F™ to B. We recall some basic 
definitions from the introduction. The correlation of a function / : F™ - 



u(D n ) = sup \(f,g)\. 

g€D n 



Given a function / : F™ — > B and a system of linear forms C = {L\ 
that the average of / over C, with conjugations a G {0, l} m , is 



with D n is 



,L m } in k variables, recall 



tc,a(f) - E XG (F«)fc 



where C is the conjugation operator. A family T> is said to be correlation testable with linear 
forms if there exists a set of linear forms L\, . . . ,Li along with conjugations a±, . . . , ct£, such that 
the collection of averages (tc l!ai {f), ■ ■ ■ ,tc e ,a e (f)) allows to distinguish whether / has noticeable 
correlation with D n or negligible correlation with D n . The true complexity (Cauchy-Schwarz 
complexity) of V is the maximal true complexity (Cauchy-Schwarz complexity) of {£i}i=i,.../. 

Definition 5.1 (Strongly correlation testable properties). A family V = {D n } n€ fq is strongly 
correlation testable by linear systems with true complexity d and Cauchy-Schwarz complexity s, 
if the following holds. For every e > 0, there exist 5 G (0,e), no G N, and systems of ho- 
mogeneous linear forms Ci, ...,£,£ in m\,...,mt variables, respectively, where each system has 
true complexity at most d and Cauchy-Schwarz complexity at most s, along with conjugations 
a\ G {0, l} mi , . . . , at G {0, l} m< such that the closures of the following two sets are disjoint: 

T e = {((t/W/), • • • , t Ck , ak (f)) : / : F™ -> B,n > n , ||/|| u(Dn) > e} 



and 



= {((t Cl , ai (/),-.., t Ck ,a k {f)) : f : K B, n > n , ||/|| u(Dn) < 5} . 
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A system T> = {D n } n ^ where D n is a set of functions from F™ to D is consistent if D n C D n+ ±, 
where we identify F™ with the subspace F™ x {0} of F™ +1 . In this section, we prove the following 
theorem. 

Theorem 5.2 (Main theorem: strongly correlation testable functions). Consider a consistent 
family V = {D n } n ^ . If V is strongly correlation testable with true complexity d and Cauchy- 
Schwarz complexity s < p, then there exists < t < d such that the following holds. Let (/„ : F™ — > 
B) n£ N be a sequence of functions. Then 

hm ||/ n -E[/ n ]|| u(Ai) =0<^ lim \\f n -E[f n ]\\ ut+ i=0. 

Define a set S C N to be the set of all degrees k > 1 for which the following holds. For every 
growth function r : N —¥ N, there exists no, a G N, such that for every n > uq there exist a 
polynomial P n over F" of degree exactly k such that 

(i) \\e P ( p n)\\ u (D n ) > 1/a; 

(ii) rank(P n ) > r(a). 

Claim 5.3. Unless all functions in V are constant functions, we have 1 G <S. 

Proof. Let g no G Z? no be a nonconstant function. There must exist a nonzero Fourier coefficient 
a G F™° such that 

Since we assume that the family V is consistent, for every n > uq, the function 

g n {x{l), • • • ,x(n)) = g no (x(l), . . . ,x(n )) 

belongs to D n . Let P n {x) = YH=i ot{i)x{i) be a linear function. For every nonzero linear function 
we have rank(P n ) = oo. By construction, each linear function P n has correlation with D n , 

\\e P (Pn)\\ u (D n ) > \(e p {P n ),gn)\ = \v\ > 0. 

□ 

The case where all functions in T> are constants is easy to analyze, as in this case we have that 

ll/IU(D„) = ll/llc/i, 

for all function / : F^ — > D. Thus, from now on we assume that there are some nonconstant 
functions in V, and hence 1 G S. 

Claim 5.4. 5C{l,...,s}. 

Proof. Let k > s, and assume by contradiction that for every growth function r : N — > N, there 
exists a, no G N, such that for every n > no, there exists a polynomial P n of degree k with 
||ep(-P n )||u(D n ) — V a an d rank(i- > n ) > r(a). We will show that in this case, the closures of T]_/ a and 
Si/ a are not disjoint for all a G N, which will yield a contradiction. Assume to the contrary that 
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they are disjoint for every a E N. Then for every a E N, there exists a minimal distance //(a) > 0, 
such that for every z' E T]/ a and z" E >?x/ we have 

Ik'-^'lloo >M(a)- (18) 

Applying Theorem 14.21 for s < p and Theorem 14.71 for fc, if we choose the rank bound r(a) large 
enough, we can guarantee that 

\\e p (P n )\\ us+ i <m(o)/2- (19) 

We first note that — (0, . . . , 0) is in Sy a for every a E N, since for / = we have tc, a (f) 
0. Combining this with (|18f) we get that for every sequence of functions f n : F" — > D with 
liminf^oo ||/ n |L(o„) > 1/a, we have 

liminf ||(i A>ai (/„),..., ^(/„))||oo >/"(«)• (20) 

n— >oo 

Consider now the polynomials P n . By Lemma 13.21 since each system Ci has Cauchy-Schwarz 
complexity at most s < p, we have 

\tc im (e p (P n ))\ < \\e p (P n )\\ Us+ i < /x(o)/2, 

for all 1 < i < Thus we reached a contradiction. □ 

We now define t := max(<S). Theorem 15.21 follows from the following two lemmas. 

Lemma 5.5. Let (f n : F™ — > D) n eN be a sequence of functions such that E[/ n ] = 0. If 
lim^oo ||/„||[/t+i = 0, then lim n ._ > . 00 ||/nL(D n ) = °- 

Lemma 5.6. Let (/ n : F™ — > B) ne pj 6e a sequence of functions such that E[/ n ] = 0. Tjf 

lim^oo ||/n||n(D n ) = °; then lim n^oo ||/n|[[/ t + 1 = °- 

We prove Lemma 1531 in Subsection 15. II and Lemma EH in Subsection 15.21 



5.1 Proof of Lemma 15.51 

Suppose that lm^-nx, ||/n|![/ t + 1 = 0, but 

c := limsup ||/„|| w (r> n ) > 0. 

Since T> is consistent, we can replace the limsup by an actual limit. Assume that c = 
linin^oo |j/ n |L(D n ) an d set e := c/2. Since V is strongly correlation testable with true complexity 
d and Cauchy-Schwarz complexity s < p, there exist 5 E (0, e), e' > 0, and a family of homogenous 
systems of linear forms Ci, . . . , C(, of Cauchy-Schwarz complexity at most s and true complexity at 
most d along with conjugations ai,...,ag such that 

||(*£i,ai(/), • • -,tc tm (f)) ~ (tCi,ai(g),-- ■ , tc e ,a e (ff)) ||oo > e', (21) 

for every : F™ -> D (with n > n ) satisfying ||/|| n(jDn ) > e and ||g|| u (£> n ) < 5. 

Let r : N — > N be a growth function to be defined later. We apply Theorem 14.81 and Theorem [33] 
to deduce that there exists a polynomial factor B n of degree s, complexity C n < C max (s, e, r(-)) 
and rank at least r(C n ), such that for h n := E(/ n |S n ) we have 

[*£<,*(/») -tA,ai(M<e72, (22) 
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for all 1 < i < I. Equations (|2ip and (|22p imply that for large enough n we have ||/in|L(_D n ) > 
So, for large enough n, there exists g n € D n such that 



|(^n,5n)| > S. 



(23) 



Let *B n be denned by polynomials Q n ,i, ■ ■ ■ ,Qn,c n and define Q nn := Ylf=i l{i)Qn,i{x) for every 
7 £ Fp . By choosing the growth function r(-) large enough, we have by Theorem 14.71 that for all 

[bias(Q n , 7 - Q n , Y )| < p" 2C ™ ,5/100. (24) 

As h n is ^-measurable, we can express it as h n (x) = F n (Q n> i(x), . . . , Q n ,c n {%)) for some F n : 
B. Consider the Fourier decomposition of F n , 



F n (z 1 ,. . . ,Z Cn ) = F n(l) e P 5^7( 



«2i 



v« = l 



We thus have 



^n(x) = ^ F n (7)e p (Q nj7 (x)), 



where |F n ( 7 )| < 1. Define W n := {7 e : deg(Q n , 7 ) < t}. We now show that the assumption 
linin.j.oo ||/ n ||[/t+i = implies that by taking n large enough, we can make |-F n ( 7 )| arbitrarily small 
for all 7 £ W n . 

Claim 5.7. For large enough n, we have \F n {^)\ < p~ Cn S/10 for all 7 S W n . 
Proof. By expanding (h n ,e p (Q nn )) we have 

F n (rf) = (hn,e p (Q n ,-y)) - ^ -^n(V) ' bias(<3 nj7 / - Q„, 7 ). 

7V7 

We bound each term individually. As e P (Q n ,-y) is B n -measurable and h n = K(f n \B n ), we have that 

\hn , Bp \Qn,y) ) = {fni^p{Qn,'y))i 

and since deg(Q raj7 ) < t we have 

|(/in,e p (Q„, 7 ))| < H/nll^+i < p" Cmax <5/100, 
for large enough n. By (f24"|) and the bound H-P^Hoo < 1, we conclude that 

1^(7)1 <P~ Cn S/W, 

for all 7 G W„. □ 
Now, Claim [57T1 implies that 



£ F n ( 7 )(e p (Q 
■y&W n 



< 5/2. 
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However, since \(h n ,g n )\ > 5 we must have that there exists 7* g" W n such that 

\(e P (Qn,r),9n)\ >P~ Cn 5/2. (25) 

We now show a contradiction to the assumption that t = max(<S). Set P n := Q n ,-y*- As Q n ,-y ^ W n , 
by [25] we have that 

• t + 1 < deg(P„) < s; 

• \K(Pn)L(D n ) >p- C S/2; 

• rank(P n ) > rank(£>„) > r(C n ). 

Let n\ < ri2 < ■ ■ ■ be an infinite sequence such that t! := deg(P„ 4 ) > t + 1 and C n = C. Since 
the family 2? is consistent, we may assume (by refining the sequence) that deg(P n ) = t' and 
rank(i- > „) > r{C) for all n £ N. As r(-) is an arbitrary growth function, we must have t' € S and 
the lemma follows. 

5.2 Proof of Lemma 15.61 

Let (f n : F™ — > D) ne N be a sequence of functions such that lim n _ >oc ||/n|L(r> n ) = 0) an d assume by 
contradiction that 

c := limsup ||/n||c/t+i > 0. 

n— >oo 

Since T> is consistent we may replace the limsup by an actual limit by refining the sequence. So 
we assume that ||/n||[/ t + 1 > c for all large enough n > uq. By Theorem 14.21 this implies (since 
t < s < p) that there exist polynomials Q n G Poly t (Fp) such that 

\(fn,e p (Q n ))\>5, 

for some 5 = 5(c) > 0. We first regularize the polynomials to have arbitrarily large rank. 

Claim 5.8. There exists 1 < to < t such that the following holds. For every growth function 
r : N — > N there exists b G N and an infinite subsequence m < 112 < ■ ■ along with polynomials 
Qni of degree exactly to, such that 

• K/n^e^))! > 1/6; 

• rank(Q ni ) > r(b). 

Proof. We first claim that it is enough to prove the claim where to may depend on r(-). Otherwise, 
assume that for every r(-) there exists a to(r) for which the claim holds, but there is no single to 
which holds for all r(-). Then, for each possible 1 < i < t, let rj(-) be a growth function for which 
the claim does not hold for to = i and rj(-). Set r = max(ri, . . . , r^), and we reach a contradiction. 

Thus, it is sufficient to show that for every growth function r(-), there exists 1 < to < t for 
which the claim holds. Applying Theorem 14. 7\ for every growth function r : N — >• N, we can express 
each polynomial Q n as a function of C n < C max (t, r(-)) polynomials Q' n l , . . . , Q' n c of degree at 
most t where Tank(Q' n 1 , . . . , Q' n c ) > r(C n ). Assume that Q n (x) = F n (Q' nl , . . . ,Q' nC ). Let 

Q'n,-y = Ylf=i l{i)Q'n i( x ) f° r 7 ^ ^p™ • *he Fourier decomposition of F n , we have 

e P (Qn(a:)) = Yl F ^) e P {Q'n,~,( x )) ■ 
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Since ||-F n ||oo < 1, we have that for every n there exists 7„ such that 

\{fnMQ'n, ln ))\> 5 P~ Cn - 

Since we assumed E[/ n ] = 0, we cannot have that Q' nan is a constant, thus we have deg(Q' n ) > 1 
and hence rank(Q' n ) > r(C n ). Let to be a number which is the degree of Q' nnn for infinitely 
many n. Since r(-) is arbitrary, the claim follows. □ 

We fix 1 < to < t given by Claim [ST8l Let r : N — > N be a growth function to be determined later. 
Since V is consistent, we can refine the sequence (/ n )neN to the subsequence given by Claim [5\8l 
and obtain there exists function /„ : F™ — > B with lim n _ i . 00 ||/n||n(D„) = 0, and polynomials Q n of 
degree exactly to, such that for every n > no, 

• \{fnMQn))\ > 1/b] 

• rank(Q n ) > r(b). 

We first derive a contradiction when to £ S. 

Lemma 5.9. If t G S, then limsup^^ ||/ n || u (D n ) > 0. 

Proof. In the proof, we think of p, s as constants and do not explicitly mention dependencies on 
them. Given the value of b, let : N — > N be a growth function to be determined later, where we 
will in particular have ri,(a) > r(a), for all a, b G N. Since to £ S, there exist polynomials P n of 
degree exactly to, functions g n £ D ra , and no, a € N, such that for every n > no, 

• \{e p (Pn),gn}\ > 

• rank(P n ) > r b (a). 

Set e := 10 oo a a b - Since P is correlation testable with true complexity d and Cauchy-Schwarz 
complexity s < p, there exist J G (0,e), r] > 0, and a family of homogenous systems of linear 
forms Cx, of true complexity at most d and Cauchy-Schwarz complexity at most s, and 

conjugations a±, . . . , ct£, such that 

||(*A,«i(/), • • • >*W/)) " (*A,«i(/0, • • • ,*W/0)lloo > 

for every /, /' : F" — > D (with n > no) satisfying ||/'|| u m n ) > e and ||/|| u m n ) < 5. We will prove the 
lemma by constructing a new sequence of functions {f n : F" — > B) ng N such that for large enough 
n, we will have 

• \{fn>9n)\ > e and hence ||/4L(z? n ) > e; 

• |iA,a,(/n) - *A,ai(/;)| < V, for all 1 < i < k. 

This will conclude the lemma as it will show that ||e p (/ n )|| u (£> n ) > 5 for large enough n. 

Let r\ : N — » N be a growth function to be determined later (whose choice depends on the 
values of a, b). By Theorem 13.61 there exists a polynomial factor B n of degree s, complexity 
C ra < C max (?7, a, 6, n(-)) and rank at least r\{C n ), such that for h n := K(f n \B n ) we have 

1^ (/in) | < r7/10, (26) 
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for all 1 < i < k, and also 

Wfn-KWus+x < 1/26. (27) 

Recall that Q n is a polynomial of degree t$ < t < s such that |(/ n , Q n )\ > V^j f° r ra > n o- By 
the choice of ( 1271 ) we hcive tlictt h n is ctlso correlated to Qn: &s 



|(/i„,e p (Q„))| > K/„,ep(Q n ))| - |(/ n - fcn,ep(Q„))| > ^ - ||/„ - Ml^+i > 1/26. (28) 

Assume B n is defined by polynomials Q' nl , ... , Q' njCn - Define Q' nry = Ya=i 7( i ) ( 2n,i for 7 € ^p n ■ 
The function /i ra = E(/ n |i3 n ) is ^-measurable, hence h n = F n (Q' n 1 , . . . , Q' n c ) for some function 
F n : F^ n ->■ ID), and we have 

K{x) = E F n{l)Zp(,Q'nv( x )) 

where ||-F n ||oo — 1- Thus we have 



1/26 < 



< E 1^(7)1 -|bias(Q; i7 -Q n )|. (29) 



We now show that when ri(-) is chosen large enough (as a function of a, 6), then almost all 
the contribution to the correlation in (]29p comes from a single term 7*. Let r\(C) be chosen large 
enough, such if R is a polynomial on F™ of degree at most s and rank at least ri(C)/2, then for 
large enough n we have 

|bias( J R)| < e ■ p~ C . (30) 

Such a choice is guaranteed by Theorem l4.71 Assuming such a choice for n(-), we have the following 
claim. 

Claim 5.10. There exists 7* G F^ n suc/i that |-F n (7*)| > 1/46 and |bias(Q' » - Q n )\ > 1/46. 

7 



Moreover, for all 7^7* we have |bias(Q^ 7 — Q n )\ < £ • P 



Proo/. Assume first that rank(Q / , 7 - Q n ) > ri(C n )/2 for all 7 € F^ 7 ' 1 . This implies by ([30]) that 
|</»n,ep(Q„)>|< E l^(7)l|bias(Q^ 7 -Q n )| <e< -i-, 



which is a contradiction to (|28p . So there must exist at least one 7 such that rank(Q^ 7 — Q n ) < 
f\{C n )/2. Assume there were two distinct 7' 7^ 7" such that rank(Q^ 7 , — Q n ) < ri(C n )/2 and 
rank(Q^ ,, — Q n ) < r\(C n )/2. Then by subadditivity of rank, this implies 

rank(Q^ 7 ,_ 7 „) = rank((<Xy - Q n ) - (Q nn » - Q n )) 

< rank(Q^y - Q n ) + rank(Q ni7 » - Q n ) < r 1 (C n ), 

which is a contradiction to the fact that {Q n ,i, ■ ■ ■ , Qn,c„} has rank at least r\{C n ). So, there exists 
a unique 7* G F^' 1 such that rank(Q^ r - Q n ) < n(C n )/2. Thus by (J3DJ) we have 

E l^(7)l|bias(Q' n , 7 - Q n )| < e < 1/106. 

7^7* 
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However, by (f28|) we have \{h n ,e p (Q n ))\ > 1/26. Hence 

|^(7*)||biaB(g / n>r -Q n )| > 1/46. 

The claim follows as |F n (7*)| < 1 and \bias(Q' nrf , — Q n )\ < 1- EH 

The next claim shows that if we require ri(-) and r(-) to be large enough, then we can guarantee 
that deg(Q^ r ) = deg(Q n ). 

Claim 5.11. 7/ri(-),r(-) are chosen large enough, then 

deg(Q^ i7 *) = deg(Q n ). 

Proof. Recall that we have by assumption rank(Q n ) > r(b) and rank(Q' ») > r\(C n ). We also have 
|bias(Q' * — Q n )\ > 1/46 by Claim [57X01 which by Theorem 14.71 implies that rank(Q' » — Q n ) < 
r'(6), for some function r'(-). We have two cases to consider: 

• If deg(Q n ) < deg(Q^ j7 *), then we have rank(Q^ 7 ) < r'(b) + 1. To avoid this case we choose 
n(C) > r'(b) + 1 for all C G N. 

• If deg(Qn) > deg(Q^ j7 *), then we have rank(Q n ) < r'(b) + 1. To avoid this case we choose 
r(6) > r'(6) + 1 for all 6 G N. 

If these two conditions are met, this guarantees that deg(Q^ j7 *) = deg(Qn) as claimed. □ 

We thus have deg(Q' n/y ,) = deg(Q ra ) = to- We now claim that without loss of generality 
we can assume 7* = e\ = (1,0, ... ,0). This can be achieved by performing an invertible linear 
transformation on the basis Q' n 1; . . . , Q' n c which preserves the polynomial factor defined by it 
as well as its rank. Summarizing Claim 15.101 and Claim 15.111 and the discussion above, we have 
established the following properties: 

1. \Fn(ei)\ > 

2. |bias(Q^ 1 - Q„)| > ^ and |bias(Q^ 7 - Q n )\ < e ■ p~ c " for all 7 / e x ; 

3. deg(Q' nl ) = deg(Q n ) = t ; 

4. xank{Q' n l ) > n(C n ) and rank(Q n ) > r(6). 

We now repeat the same process for g n . Let A — sp ^ max . There exists a polynomial factor 13 f n 
of degree s and complexity C' n < C' max defined by polynomials {P^ i : 1 < i < C' n }, such that for 
g' n := E(g n \B' n ) we have 

Wdn - g'nWu^ 1 < K 

and also that g' n = G n {P' n 1 , . . . , P' n c , ) where 

1. \G n {ei)\ > i; 

2. |biaa(J* J - P n )\ > £ and |bias(P^ i7 - P n )\ < e ■ p~ c ™ for all 7 ^ e x - 

3. deg(i* J = deg(P„) = t ; 



25 



4. rank(P^ 5l ) > ri(C' n ) and rank(P n ) > r b (a). 

We now define a new polynomial factor B n as follows. Let R n ,2, • • • ,R n ,c n De random polyno- 
mials in F™ chosen such that deg(R nt i) = deg(P^j). We note that for every rank bound r*, for 
large enough n the following holds with high probability. Let P 1 be some (possibly zero) linear 
combination of P n ,P^ x , . . . , P' n c , and let R 1 be a nonzero linear combination of Rn,2, •••■> Rn,c n - 
Then as long as deg(R 1 ) > deg(P 1 ) we have that rank(P x + R 1 ) > r*. 

We define B n = {P n , P n , 2 , R n , Cn }- Define P n , 7 = l(l)P n (x) + E?=2 j(i)R n ,i( x ) f ° r 7 £ F^ 7 ™ . 
We define the new sequence of functions as 

ft l (x) = F n (P n ,R n>2 ,...,R n>Cn ). (31) 

We conclude the proof by showing that \tc i)ai (fh) ~ tdmifn)] < f? for all 1 < z < £, but that 
\{f n ,9n)\>e. 

Claim 5.12. \t Ci , ai (f n ) - I < V for all 1 < i < t. 

Proof. Note that deg(R n ^) = deg(P nj j) for all 1 < i < I. Since all the linear forms are homoge- 
nous, we can apply Proposition 14.91 By the proposition, there exists a bound r'{r\) such that if 
rank(S n ), rank(£? n ) > r' then the claim follows. To ensure this, we require for B n that r\{C) > r' 
for all C € N; and for B n that r* > r' and that r&(a) > r' (note that it is crucial to allow r to 
depend on both a, 6, since r\ depends on both a, b). □ 

Claim 5.13. \{f' n ,g n )\> e. 

Proof. We first claim, it suffices to prove \(f n ,g' n }\ > 2e. Indeed 



\(fn,9n- 9n)\ < \ F n{l)\\( G p{ R na) , 9n ~ 9 n )\ < P^hn ~ g'n\\u°+^ < ? , 

where we used the fact that deg(P nj7 ) < s. Now note that 



(fn,9n)= F n {"j)G n {i)b\as(R nn - P' nn ,). 



Consider first the term 7 = 7' = e±. We have R n , ei = Pn and by our construction, 



1 1 1 

46 4a 4a 



i ? n(ei)G„(ei)bias(P„-P^ 1 ) 

We now bound all other terms (7,7') 7^ (ei,ei). By choosing ri(-),r* large enough, we can bound 

|bias(P n , 7 - i* y )| < e 

Putting all these together we conclude that \(f n ,g n )\ > £ as claimed. □ 
This concludes the proof of Lemma 15.91 □ 
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Lemma [5.91 shows that if to £ <5, then we are done. We will next show that S = {1, . . . , t} which 
will conclude the proof of Lemma 15.61 Consider any t' < t and systems of linear forms Ci, ■ ■ ■ , Lg. 
For a function / : F™ — > E we shorthand tc(/) = tc, a (f) for all conjugates a since / = /. In 
Lemma l5.14l b elow we shall show that it is possible to construct two different families of functions 
f n ,f n :E2-> [-1,1] such that 

(i) / n and cannot be distinguished by averages i/v; 

(ii) / n has correlation with polynomials of degree exactly i and of arbitrarily high rank; 

(iii) f' n is a linear combination of a bounded number of exponentials of polynomials of degree 
exactly t' and arbitrarily high rank. 

We note that the combination of (i), (ii), (iii) implies that t' € S: Condition (ii) im- 
plies by Lemma 15.91 that limsup^^^ ||/n|U(D„) > 0- By Condition (i) this implies that also 
limsup n _, 00 \\f n \\ u {D n ) > 0; so there exist g n € D n such that lim sup^^ \(f' n ,g n )\ = 5 > 0. By 
Condition (iii) we can express f' n (x) = Ylf=i a i e p(Qn,i( x )) where C is a uniform bound, |o$| < 1 
and Q Hy i are polynomials of degree t' and arbitrarily high rank. Hence, we must have for infinitely 
many n that |(e p (Q n> j), g n )\ > 5p~ c for some i. Since T> is consistent, we can extend this to all 
large enough n and complete the proof. 

It only remains to prove the following lemma. 

Lemma 5.14. Let £i,...,Ci be systems of linear forms. Let if < t. There exists functions 
f n , f' n : Fp — > [— 1, 1] and a constant C £ N such that the following holds: 

(i) For every % & [£], 

tc i (fn)=tc i (fn)- 

(ii) There exist polynomials P n : Fp — > ¥ p satisfying deg(P n ) = t and linin-yoo rank(P n ) = oo, and 

liminf \(e p (P n ),f n )\ > 0. 

n— ¥oo 

(Hi) f' n is a linear combination of exponentials of C high rank polynomials of degree exactly t'. 
That is, there exist polynomials Q n ,i, ■ ■ ■ , Qn,c : F^ F p satisfying deg(Q nj «) = t' and 
limn-j.oo rank(<5n,i) = oo for all 1 <i<C, and 

C 
i=l 

where \cti\ < 1. 

Proof. We first note that it is sufficient to prove the lemma for systems of linear forms which are 
non-isomorphic and connected, since we can decompose each system to its connected components 
and remove isomorphic copies. 

First let us introduce some notations. For positive integers m > n, let 7r m _> n denote the natural 
projection from F™ to F^ defined as 

^rn^n • (^1 j • • • j •^m) 1 ^ 0^1 j • • • ! x nJ • 
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For functions / : F™ ->• C and g : F£ ->• C, let / ® g : ¥ m+n -> C denote the function 

; ^m+n) — /(-^l j ■ ■ ■ > •^m)5 , (-^m+l > • • • > ^m+n) j 

and note that for every system of linear forms C, we have 

tdf®g)=t c {f)t c {g). 

By Theorem 17.11 there exist a constant iV € N, an e > 0, and a function F : F^ — >■ [0, 1] such 
that 

{^K' I lk-(t£ x (n--- 5 ^(^))lloo<e} C {(t A (/),..., t A (/)) | / : F^ ->• [0,1]} . (32) 

Consider two sequences of polynomials P n ,Qn '■ F" — > F p such that deg(P n ) = i and deg(Qn) = i', 
and 

lim rank(P n ) = lim rank(Qn) = oo. 

n— yoo n— yoo 

Define g n : F™ — > {— 1/p, 1 — 1/p} as g n {x) = 1 — 1/p if and only if Q n {x) = 0. Note that 



9n(x) = -e P (aQn(x)). 



aGF p \{0} P 

Define h n : F£ -> {0, 1} as /i n (x) = 1 if and only if _P n (x) = 0. 

Let (5 > be sufficiently small so that for every m > N, and every i € [A;], 

I^^P) " i^OWlm + (1 " 5)^ O 7T m ^ N )\ < e/2. 

Then by (I32D . for every m > N, there exists a function G m : F^ — > [0, 1] such that for every i € [&], 

tCi(G m ) = t Cl (5h m + (1 - £)F o 7r m ^Ar). (33) 
For n > 2N, let m = [n/2\ and define /„, f' n : F™ -> [-1, 1] as 

f n := #„_ m <g> (£/i m + (1 - 5)F o ir m ^ N ), 

fn := 9n—m ® (G m O 7T m _>7v)- 

By (|33p for every z € [fc] and every n > 2N, 

tCi(fn) = tCi(gn-m)tCi(fih m + (1 - S)F o n m ^. N ) = t c ^g n _ m )t Ci {G m o it m ^ N ) =t Ci {f' n ), 

which establishes (i). To establish (ii), let R n := Q n - m <g> P m . Note that R n is a polynomial of 
degree t and lim n ^.oo rank(i? ra ) = oo. We have 

{fn,e p (R n )) = (g n—m ; 

(Q n—m )) • (<5^m + (1 - <8> 7r m ^AT,e p (P m )). 
We now lower bound the terms. By the definition of g n -m, we have 



I \9n—mi &p(Qn—m)) | 



— I — bias((a — l)Q r 

p p ^— ' 

aGF p \{0,l} 



1 

- 2p' 
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for large enough n — m since rank(<5 n -m) oo. The function F (g> ir m -^N depends only on the first 
./V variables; hence we have lim m ^. 0O (F(8)7r m _ i ,7v ; e p (P m )) = since rank(P m ) — >• oo by Theorem l4.7l 
Finally, by the definition of h m we have 

1 

2p 

for large enough m, since rank(P m ) — > oo. We thus conclude that 

1 

Ap 2 

for large enough n, which establishes (ii). To conclude the proof we establish (iii). Let 



(h m ,e p (P m )) = Pr x [P m (X) = 0] > — 



\{fn,e p (R n ))\>-^ 



G m {x) = Y, G(7)ep [y^) x( A 

7GF^ \i=l J 



where |G(7)| < 1. We thus have 

f n {x) = g n - m (x(l),...,x(n-m))G m (x(n-m + l),...,x(n-m + N)) 

= Y - G (l) e p I «Qn-m(^(l), • • • , x{n - m)) + y \ j(i)x(n -m + i)\ . 

7eF^,aGFp\{0} P \ i=X J 

Hence, we can express f' n as the linear combination of C = (p — l)p N exponentials of polynomials 
of degree exactly t'; and as N is fixed and n — m — > oo, their rank is unbounded. This established 
(iii) and concludes the proof. □ 

6 Characterization of correlation testable properties 

Consider a family T> := {D n } n ^ where D n is a set of functions from F" to ¥ p . Given a function 
f n : F™ — > F p , we want to probabilistically determine whether f n has correlation with D n . That is, 
we want to determine if ||ep(f n )|| u m n ) is non-negligible or not, where we are only allowed to read 
the value of f n on a few points. We recall the definition of correlation testable properties. 

Definition 6.1 (Correlation testable properties). A family T> = (D n ) is correlation testable with q 
queries, if there exists a distribution \i taking values in (F™) 9 and a mapping T : Fp — > {0, 1}, such 
that the following holds. For every e > 0, there exist 5 E (0, e), < 9~ < 9 + < 1 and uq £ N, such 
that for every n > uq and f : F^ — >■ ¥ p we have: 

• // l|e P (f)L ( D„) > e then Pr (Xlj ... ) ^ ) ^[r(f(X 1 ), . . . , i(X q )) = 1] > 0+. 

• // l|e P (f)L ( D„) < S then Px {Xu ... jXq) ^\r(i(X 1 ), . . . ,f(X q )) = 1] < 0- . 

We study proper dual families T>. We recall that a family T> is proper dual if the following 
conditions hold: 

• Al: Consistency For positive integers m > n and g E D n , the function h : FL" — > ¥ p defined 
as h(xx, . . . ,x m ) = g(xi, ... ,x n ) belongs to D m . 

• A2: Affine invariance For every positive integer n, if g E D n , then for every A E Aff (n, F p ), 
we have Ag E D n . 

• A3: Sparsity For every e > and large enough n, we have \D n \ < p £pn . 
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6.1 Correlation testing by averages over linear forms 



We first show that if V is a proper dual family which is correlation testable using q queries, then 
it is in fact also testable using averages of linear forms. When arguing about functions to F p , one 
may allow more general types of averages. Let C = {L\, . . . , L m } be a system of linear forms in k 
variables. Let j3 G ¥™ be a vector of coefficients. Recall that for a function f : ¥ p — > F p , we define 
the average 



*£,/3( f ) - E Xe(F£)* 



e p X>« f ( L *( X )) 



vi=l 



We note that for functions f : F™ — >■ ¥ p , these averages generalize the previous averages tc, a which 
were defined for bounded functions. Indeed, for a G {0, l} m let /3 G {—1, l} m be defined as 
P(i) = then 

Lemma 6.2. Suppose that a proper dual family V = {D n } is correlation testable with q queries. 
Then for every e > 0, there exists 5 G (0,e), no G N, and homogeneous systems of lin- 
ear forms Ci, ...,£■£ with mi,...,m,£ linear forms, accordingly, and corresponding coefficients 
Pi G F™ 1 , . . . ,/3i G ¥ p ne , such that the closures of the following two sets are disjoint: 



Te := m i>Pl (f), t* cat (f))\n > n , f : F-^ F p , ||e p (f)|| u(Dn) > e}, 



and 



Se := m ufil {f), • • • , t* Ce ^(f))\n > n , f : F£ -> ¥ p , ||e p (f)L (Dn) < 6}. 
Moreover, the systems C\, . . . ,£e have Cauchy-Schwarz complexity at most q — 2. 

Proof. Since D n is a proper dual, by Condition A2, for every f n : F" — > ¥ p and every A G Aff(n, ¥ p ), 
we have that ||e p (ylf ri )|| ?t (£i n ) = ||ep(f n )|| u m n ). Let A G Aff(n,F p ) be a uniform random invertible 
affine transformation. Then by the assumption that T> is correlation testable, we have that 

• If ||e p (f n )|| u(Dii) > e, then Pr (Xl) ... )Xfl) ^ j7lgAff(niFi)) [r(f n (AYi), . . . ,f n (AX q )) = 1] > 9+. 

• If IMk)||„(£,„) < <*> then Pr(X ll ... ) X,)~ / x„A6A£F(n,F p )[r(^(AYi),...,^(AY,)) = 1] < 0". 

We establish the lemma by showing that if we set no large enough, then the probability 



Pr 



,A e Aff(n,F p )[r(f(,4AU . . .,f(AX q )) = 1] 



can be approximated with an arbitrarily small error by linear combinations of t*^ ^ (f ) , . . . , t£ ^ (f ) , 
where {(A, fii)}i<i<e are all possible homogeneous systems of at most q linear forms. Note that £ 
is a constant depending only on p, q, and that the Cauchy-Schwarz complexity of any homogeneous 
system of t < q linear forms is at most t — 2 < q — 2. We start by decomposing T to its Fourier 
decomposition 

T(z(l), . . .,z(q)) = £ f (7)ep f^W " . 

7 eF? \i=i / 

We thus have that 

Pr ( x 1 ,...,x 9 )^,^Afr(n,F p )[r(f(^ 1 ), . . . ,i{AX q )) = 1] 



7GF? 
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where the expectation is taken over (Xi, . . . , X q ) ~ \i and A ~ Aff(n,F p ). Thus, it is enough 
to show that each term E [e p (^f =1 f3(i) ■ f(AXi))] can be approximated by linear combinations of 

( f )}l<i<fc- 

Fix (xi, . . . , x q ) € (F p ) 9 . Suppose that the rank of spanjxi, . . . , x q } over F p is r. Let yi, . . . ,y r G 
F p form a basis for span{:ri, . . . , x q }, so that Xj = X^j=i ^i,jVj' f° r every 1 < « < g. Then the distri- 
bution of (Aci, . . . ,Ax q ) is the same as the distribution of (Yo + Yfj=i ^ij^j, ■ ■ ■ ,^o + X^j=i ^g,jYj), 
where Yq, Yi, . . . , Y r are i.i.d. random variables taking values in F p uniformly at random conditioned 
on Y\, . . . , Yi being linearly independent. However since if we pick Y±, . . . ,Y r independently and 
uniformly at random, with probability 1 — 0^-5.00(1) they will be linearly independent, by tak- 
ing n to be sufficiently large this distribution can be made arbitrarily close to the distribution of 
(Yq + Ylj=i ^,jYj, . . . , Yq + Y2j=i ^qjYj), where Yq, . . . , Y r are i.i.d. random variables taking values 
in F™ uniformly at random. 

Thus, we can approximate each term E [e p (Y^i=i j3[i) ■ i{Axi))\ by 
E Gp(Xrf=i /^(*) ' KYo + Y^j=i ^ijYj)) , which is one of the averages t* c _ g _{i). We now con- 
clude the proof, since E [e p (^? =1 (3(i) • f(AX»))] where (Xi, . . . ,X q ) ~ \x can be approximated by 
an appropriate weighted average of t* c , a (f ) , . . . , f£ B (f ) . □ 

6.2 From field functions to distributional functions 

The next step is to move from functions f : F™ — > ¥ p to functions whose output lies in some 
convex set. Once this is accomplished, we can use the same techniques used for studying functions 
/ : F™ — > B, there were used in Section 

Let -P(Fp) denote the family of probability measures over F p . That is, -P(F p ) C W is given by 

P(¥ p ) = {// : F p [0, 1] : £ //(c) = 1}. 

cGFp 

We identify every element c £ ¥ p with its corresponding dirac measure on ¥ p . That is c G F p is 
corresponded with the probability measure \i c where /U c (c) = 1 and // c (c') = for all c' 7^ c. We 
refer to functions r : F™ — > P(¥ p ) as distributional functions. Note that they are a superfamily of 
functions from F p to ¥ p , which can be regarded as deterministic functions. Given a distributional 
function T, we identify it with a distribution over functions from F™ to ¥ p : If F ~ T, the value F(x) 
is independently chosen for every x € F p according to the distribution F(x). 

We extend the notion of averages t£ g to distributional functions T. For c € F p define the 
function a c : P(F p ) — > B to be 

o c (/x) = E 2 ^[e p (c • z)]. 

For a distributional function r : F™ — > P(¥ p ) we consider the functions a c o T : — >• B, which can 
equivalently be defined as 

(Ocor)(x) =E F ^ r [ep(c-F(x))]. 
Let C = {L\, . . . , L m } be a system of m linear forms in k variables, and let f3 £ ¥ p n . We define 



*£„3( r ) — E Xe (F«)fe 



or)(L f -(x)) 



Lt=l 

Note that for functions f : F p — > F p this definition identifies with our previous definition. 
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Claim 6.3. Let C = {L±, . . . ,L m } be a system of linear forms in k variables. Let (3 € F™ be a 
vector of corresponding coefficients. Then for every distributional function T : F™ — > P(¥ p ), and 
every e > 0, we have 



Pr F , 



t*cAF)-tl (T) 



< e 



l-On(l). 



Proof. The proof follows by a first and second moment estimation, and then applying Chebyshev's 
inequality. Let F ~ V. Fix x € (F™) fc , and consider the random variable 



A(x)=e p ^/3«F(^(x))j . 
We have t* c ^(F) = X^xe(F n ) fe ^( x )- Note that when Li(x), . . . , L m (x) are all distinct, we have 

n 

E F „ r [A(x)] = [[(a^orj^fx)). 



i=l 



Thus, we get that 

|EF~r[*^(F)]-t^(r)|<P [^i(X) 



, L m (X) not all distinct] < m p n , 



where the second inequality follows by the union bound. We now bound the variance of 
i£^(F). Note that two random variables A(x'), A(x") are independent if {Li(x'), . . . , L m (x')} 
and {Li(x"), . . . , L m (x")} are disjoint. We thus can bound 

Varp^rK^F)] < Pr x , x » e(F n )t [{Li(x'), . . . ,L m (x')} D {Li(x"), . . . ,L m (x")} ^ 0] < ^V", 

where the second inequality follows from the union bound. The claim follows for Chebychev's 
bound. □ 

We extend also the notion of correlation to distributional functions. We shorthand e p (r) := 
Oi o T, and consider 

l|ep( r )llu(D„) = sup |(ep(r),ep(g))|. 

Claim 6.4. Let V = {-D n }neN be a proper dual family. Then for every distributional function 
r : W™ — > P(F p ) and any e > we have 



Pr F , 



|e P (F)|| u(Dn) - ||e p (r)|| u(Dn) 



> e 



On(l). 



Proof. Fix g G D n , and consider the random variable (e p (F), e p (g)) = ^ Ylxew™ e p(^( x ) ~ s( x ))- 
Its expected value is (e p (r), e p (g)), and since the values {F(x) : x € F™} are chosen independently, 
we can apply Chernoff 's bound and get that 

Pr F .r [Kep(F),ep(g)> - (e p (r),e p (g))| > e] < 2e~^ 

for some constant c = c(e) > 0. By the sparsity Condition A3 we get that for every c' > there 
exists no, such that for every n > uq we have \D n \ < p cp . We conclude the proof by choosing d 
small enough such that p c < e c , and apply the union bound over all g <G D n . □ 
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We thus obtain the following lemma, which allows us to consider distributional functions instead 
of field functions. 

Lemma 6.5. Suppose that a proper dual family T> = {D n } is correlation testable with q queries. 
Then for every e > 0, there exists 5 S (0, s), uq € N, and homogeneous systems of lin- 
ear forms C\ , . . . , Ce with mi , . . . , mi linear forms, accordingly, and corresponding coefficients 
/?i E ¥ p ni , ...,/% e W£ l , such that the closures of the following two sets are disjoint: 

Te := m iA (T), . . . ,t* Ce ^(T))\n > n ,T : ¥ n p ^ P(¥ p ), ||e p (r)|| u(I?n) > e}, 

and 

Se ■= m uPl (T), . . .,t* CtA (T))\n > n , T : F™ — > P(¥ p ), ||e p (r)L (Dn) < 5}. 
Moreover, the systems C±, . . . ,Cg have Cauchy-Schwarz complexity at most q — 2. 

Proof. Apply Lemma 16.21 for T> and e/2. There exist 5 6 (0,e/4) and systems of linear forms 
Cl,...,£i with mi, . . . ,mi linear forms, accordingly, along with coefficients /3i € ¥ p ni , . . . , € ¥ p ne , 
such that the closures of the sets 

T £ /2 ■= {(i£ lA ( f )' • • • >*W f ))l» > n °' f : ¥ p ^ F P' IMOLtAO > 

and 

Se/2 ■= {(*A A (f), • • • ,*Wf))l« > n ,f : F"^ F p , ||e p (f)|| w(Dn) < 25} 

are disjoint. For a distributional function r : ¥ p — > P(F p ) define z(T) := (t^ ^(r), ^(r)). 
Let c > be the distance between the closures of T £ / 2 and S E / 2 , and set e' = c/4. We will 
show that for large enough n > no, if T : ¥ p — >• P(F P ) has ||e p (r)|| u m n ) > e then z(r) is within 
distance e' from T £ / 2 ; and if ||e p (r)|| u m ) < <5 then z(r) is within distance e' from S e /2', this 
will conclude the lemma. 

Consider first the case where ||e p (r)|| u m n -) > e. Let F ~ T. By Claims [6731 and [6~41 we have that 
there exists no, such that for every n > no we have 

Pr F ^ r [||e p (F)|| u{Dn) > e/2] > 0.99, 

and that for every 1 < i < t we have 

Pr F ~r[|t;U(F) " < A > 1 " 

By the union bound, there exists a specific f : ¥ p — > ¥ p such that both conditions hold. That is, 
z(f) € S e /2 and ||z(r) — 2(f)||oo < e'. The case where ||e p (r)|| u (£> n ) < 5 is completely analogous. □ 

We thus study from now on distributional functions r : ¥ p — > P(¥ p ). The next step is to define 
averages of such functions with regards to polynomial factors. Let B be a polynomial factor. We 
define the average E(r|£>) : F™ — > P(¥ p ) as follows. Assume B defines a partition CiU . . . UC& of 
F^. For x € Cj define E(r|i3)(x) to be the average of T(y) over y € Cj, 

E(T|B)(x) = 4r E W 

Note that a c o E(T[B) = E(o c o r|S). 
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We say a polynomial factor B is (d, £)-good for T if, informally, U d+1 norms cannot distinguish 
between T and E(T|Z5). Formally, we say a polynomial factor B is (d, 5)-good for T if for all c G F p 
we have 

\\a c oT - a c oE(T\B)\\ ud +i < 5. (34) 
We first argue that (d, <5)-good polynomial factors exist for every distributional function. 

Claim 6.6. Let 5 > 0, d < p and r(-) be an arbitrary growth function. Then for every distributional 
function T : F™ — >■ P(F p ) there exists a (d, 5) -good polynomial factor B with degree d, complexity 
C < C max (p, d, 8, r(-)) and ranA; oi least r(C). 

Proof. Apply Theorem 14.81 on the set of functions {a c o T : c G F p }. □ 

The next claim is similar to Theorem 13.51 It shows that if T is a distributional function, and if 
B is a (d, £)-good polynomial factor for T where 5 > is small enough, then averages t*£ g cannot 
distinguish between T and E(T|B) if the true complexity of C is at most d. 

Claim 6.7. Let C = {L±, . . . , L m } be a linear system of true complexity d and Cauchy-Schwarz 
complexity at most p. For every e > 0, there exists 5 > such that the following holds. Let 
r : F™ — > -P(Fp) be a distributional function and let B be a (d,5)-good polynomial factor for T. 
Then for every f3 G F™, 

\tlp(T)-tlp(E(T\B))\<e. 
Proof. Let /j := ag(j) o T for 1 < i < m. We have 



t*c,p(T) — ^xe(F£) fe 
and since ag^ oE(T|£>) = K(fi\B), we also have 

tl 8 (E(T\B)) = E Xe(F „ )fe 



.i=i 



flEUimuix)) 



The claim follows by Theorem 13.51 and (|34p . □ 

We would also require an analog of Proposition 14.91 Let T : F™ — > -P(F p ) be a distributional 
function, and let B be a (d, <5)-good polynomial factor for T defined by polynomials Pi,... ,Pc- 
Let B' be another polynomial factor defined by polynomials Q%, . . . , Qc- We define a new hybrid 
distribution, denoted E(T|£> — > B') as follows: assume E(T|£>)(x) = F(P\(x), . . . , Pc(x)) where 
F : Fp — > P(F p ) is some function; we define 

F(T\B := F(Qi(x), . . . , Q c {x)). 

Lemma 6.8. Let C be a homogeneous system of m linear forms. Let d > 1 be a degree bound, 
and e > a required error. There exists r m ; n = r m ; n (m, d, e) such that the following holds. Let 
B, B' be polynomial factors of degree at most d defined by P±, . . . , Pq and Q±, . . . , Qc, respectively. 
Assume that deg(Pj) = deg(Qi) for all 1 < i < C and rank(£>), rank(£>') > r m \ n . Then for every 
distributional function V : F^ — > P(F p ) and any f3 G F™ we have 

\tl 8 (E(T\B))-tl 8 (E(T\B^B'))\<e. 

The proof is identical to the proof of Proposition 14.91 and we do not repeat it. 
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6.3 Proof of Theorem [TBI 



The proof follows very similar lines to the proof of Theorem l5.2i We will highlight the changes that 
need to be made in the proof, and avoid repetition wherever possible. 

Let T> = {-D n }neN be a proper dual family where D n is a family of functions from F" — > ¥ p . By 
Lemma 16.51 we get that for every e > there exists 5 € (0, e), no € N and systems of homogeneous 
linear forms Ci,...,Ct which have Cauchy-Schwarz complexity < q — 2, along with coefficients 
Pl,...,/3i, such such that the closure of the following two sets are disjoint: 

Te ■= {(*A A (r), • • • ,t^ A (r))|n > n , r : F™ — > P(¥ p ), ||e p (r)|| u(Dn) > e}, 

and 

S £ := m ufil (T), . . .,t* CeA (T))\n > n ,T : F^ P(¥ p ), ||e p (r)|| u(Dn) < 5}. 

Let s<q — 2<pbea bound on the Cauchy-Schwarz complexity of C\, . . . ,£t and d < s be a 
bound on their true complexity. We define S in the same way as in the proof of Theorem 15.21 

Claim 6.9. Unless all functions in T> are constants, we have 1 € S. 

The proof is identical to the proof of Claim 15.31 The case where T> consists of only constant 
functions is analyzed in the same way, hence we assume that 1 £ 5 from now on. 

Claim 6.10. S C {l,...,s}. 

Proof. The proof is identical to the proof of Claim 15.41 The only difference is the argument why 
0^ is in the closure of S £ . Let T map every element of F^ to the uniform probability distribution 
over F p . Then it is easy to verify that (^^(r)! = 0{p~ n ) for all □ 

We denote t := max(S). Let (T n : F£ P(¥ p )) neN be a sequence distributional functions 
where linin-^oo E[e p (r n )] = 0. Similar to the proof of Theorem I5.2( the proof of Theorem 11.31 follows 
from the following two lemmas. 

Lemma 6.11. // lim^oo \\e p (T n ) \\ ut +i = then lim^oo ||r n || u ( Dn ) = 0. (Should be 

\K(Pn)\\u(D„) = 0) 

Lemma 6.12. 7/lim^oo ||/ n |L(£> n ) = then lim^oo ||/n||c/ t + 1 = 0. 

The proof of both lemmas is identical to the proof of Lemmas 15.51 and 15.61 where the only 
difference is that one considers averages t* c . «-(r„) instead of tc uai (f n ) and apply the claims proved 
in the previous subsection for function r : F^ — > P(¥ p ) instead of their analogs for functions 
/ : F" — >• B. The only lemma whose proof needs to be slightly changed is Lemma 15.141 We 
sketch below an analog version for distributional functions. First, note that for every function 
F : F p — y [0, 1] there exists a distributional function Fp : ¥ p — >• P(¥ p ) such that a c oTp = F for 
all c G F p \ {0}: simply set PrT>(x) = 0] = F(x) + (1 - F(x))± and Pt[T f (x) = z] = (1 - 
for z € Fp \ {0}. Moreover, this implies that for every system of m linear forms C and coefficients 
/3e(Fp\{0}r, 

t*c^T F ) = tc(F). 

The lemma now follows, when in the proof of Lemma 15.141 one replaces F with Tp, g n with Q n (a 
deterministic distributional function) and h n with Th n . 
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7 Non-isomorphic connected systems have nonempty interior 



We prove in this section the following theorem. 

Theorem 7.1. Let £\, . . . , £k be non-isomorphic connected systems of linear forms. For sufficiently 
large n G N, the set of points 

{ (t Cl (/),..., t Ch (/)) | / : F£ -> [0, l]}C! fc (35) 

/tas a non-empty interior. 

We divided this section into two parts. In Section [7.11 we introduce new notations and develop 
some preliminary tools. The proof of Theorem 17.11 is given in Section [7.21 



7.1 Flagged systems of linear forms 

Consider a system of linear forms £. Recall that £ is connected if there does not exists nonempty 
5C£ such that span(S') D span(£ \S) = {0}. Suppose that there are subsets Si, S2 C £ such that 

span(Sj) n span(£ \ Si) = {0}, 

for i = 1,2. Then for T = Si fl S2, we have 

span(T) n span(£ \ T) = {0}. 

This in particular shows that (up to the isomorphisms) there is a unique way to partition a system 
of linear forms £ into disjoint connected systems of linear forms £\, ■ ■ ■ ,£/.. We call each one of 
£\,...,£k a connected component of £. Since connectivity is invariant under isomorphisms we 
have the following trivial observation. 

Observation 7.2. Two systems of linear forms £\ and £2 are isomorphic if and only if there is a 
one to one isomorphic correspondence between their connected components. 



A 1-flagged system of linear forms is a system of linear forms £ and a non-zero linear form 
M € span(£). We use the notation £ M to denote such a 1-flagged system of linear forms. Here £ is 
called the underlying system of linear forms of £ M 
invertible linear transformation T : span(£o) ~~ ^ span(£i 
to £q induces an isomorphism between £q and £\. 

Let £ be a system of linear forms in k variables. For a 1-flagged system of linear forms £ M 



f°and rf 1 
that maps Mq to M\ and its restriction 



We call £q 10 and isomorphic, if there is an 



and a function / : F" — > C define the function / 



Cby 



f c :x^E Xe 



(*5) 



n wx)) 



M(X) 



Note that we have 



i £ (/)=E™ f c (X) 



(36) 



(37) 



Let £^f° and be 1-flagged systems of linear forms in F^ and F^ 1 , respectively. We want 
to define an operation that "glues" these two systems to each other by identifying Mq and M\\ To 
this end, first we consider the system of linear forms £' defined as 



£' = {L 1 



G ¥* 0+kl : L G £ } U {0 L G F^ 0+fcl : L G £1}. 
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Take any element M 7^ in Fp 0+fcl , and any surjective linear transformation T : Fp 0+fcl — >• Fp 0+fcl 1 
that maps both M and Mi to M. Then the product of £g fo and £f fl which is denoted by 
£M _ £Mi - g defined as the i-flagged system of linear forms 

£^o. £ Mi := (T(C')) M . 

Note that since T is surjective, this definition does not depend (up to isomorphism) on the particular 
choices of the linear form M and the map T. The restrictions of T to each one of the sets 

span{L © G F^ 0+fcl : L G C }, 

and 

span{0©L G W* 0+kl : L G A} 

is invertible, and thus T induces isomorphisms between these sets and their corresponding d (i = 
or 1). Therefore we shall refer to {T(L © 0) G F£°+ &1 : L G £ } M ° and {T(0 © L) G F^ 0+fcl : L G 
£ 1 |Mi reS p ec tively as copies of C^f° and £;| Wl in C^° -C^ 1 . In the sequel, we will frequently identify 
1-flagged system of linear forms with their copies in their product. 

The definition of the product of 1-flagged systems of linear forms is motivated by the following 
fact: It follows from (j36j) that if C^° and C^ 1 are 1-flagged systems of linear forms, then for every 
function / : F™ — > C, we have 

f£o -A = f c o f c i . (38) 

Lemma 7.3. Let C^f° and C^ 1 be 1-flagged systems of linear forms such that both Co U {Mo} and 
C\ U {Mi} are connected. Then £ U {M} is also connected, where C M := C^f° ■ C^ 1 . 

Proof. Suppose that £ and £ x 1 are respectively in k$ and k\ variables. Let T : Fp 0+fcl — > 
jpfco+fci— 1 as [ n the definition of the product of two 1-flagged systems of linear forms given above. 
Consider a nonempty set S C £ U {M}. Suppose to the contrary of the assertion that 

span(S) n span((£ U {M}) \S) = {0}. (39) 

We identify C^ and £f 1 with their copies in C . In particular, both Mq and Mi are identified 
with M. Since Co U {M} and C\ U {M} are both connected we have M G span(£o \ {M}) and 
M G span(£i \ {M}). Then it follows from (|3SJ) that we have S/£«U {M}, for i = 1,2. Also by 
(j39|) we have 

span(S n (£ U {M})) n span((£ U {M}) \ S) = {0}, 

and 

span(S n (£1 U {M })) n span((£i U {M}) \ 5) = {0}. 
Thus at least one of £0 U {M} or £1 U {A/} is not connected which contradicts our assumption. □ 

For a system of linear forms £ and an L G span(£) define deg £ (L) to be the number of pairs 
(x,y) G £ x £ satisfying x + y = L. 

Let £ M := £q^ • £^ where £^ and Cf 1 are 1-flagged systems of linear forms. It follows from 
the definition of the product that if x + y G span({M}) with x G Co and y G £1, then both x,y 
belong to span({M}). Hence for every L G span({M}), we have 

deg £o (L)+deg £l (L) < deg £ (L) < deg A) (L)+deg A (L) + |span({M})n£ | + |span({M})n£i|, (40) 
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and similarly for L £ Ci\ span({Af}) where i = 0, 1, we have 

deg £i (L) < deg £ (L) < deg £i (L) + 2|span({M}) n (41) 

Lemma 7.4. Xei £f \ . . . , C^ k be non-isomorphic 1- flagged systems of linear forms such that 
Li U {Mi} are connected for all i 6 [k]. For every N > 0, there exist a 1-flagged system of linear 
forms C M such that rank(span(£)) > N, and the underlying systems of linear forms of C M ■ Cf^ 1 
for i £ [k] are connected and non-isomorphic. 

Proof Let d > p + N be larger than the size of £j, for every i £ [k]. Denote e.\ = (1, 0, . . . , 0) £ F d 
and consider the system of linear forms 

M := ({0} x F^ 1 ) U ({1} x {0, l}^ 1 ) \ {0, Cl } C F d . 

We claim that M is connected. Indeed F^ -1 \ {0} = ({0} x F^ 1 ) \ {0} C M is trivially connected, 
and hence if S C A4 is such that span(S') n span(A4 \ S) = {0}, then without loss of generality we 
can assume that 

({0} x F^ 1 ) \ {0} C S. 

One can easily verify that S cannot be equal to ({0} x F^ _1 ) \ {0}. Hence there exists at least one 
element L £ Sn ({1} x {0, l}^ 1 ). Then M C span({L} U ({0} x F^ 1 )) C span(S') which together 
with the assumption span(5) n span(A4 \ S) = {0} shows that S = M. Hence Mi is connected. 

Note that 2 d ~ x < deg_ M (L) < 4p d_1 for every L £ Mi, and deg M (e\) = 2{2 d 1 — 1). Furthermore 
for every A £ F p \ {0, 1}, we have deg^Aei) = 0. Also we have span({ei}) n Ml = 0. Set 

Z M := M ei • . . . • M ei . 

V v ' 

10p d times 

and 

C M := (£U{M}) M 

By (|40p and (|4ip . and the above properties of Mi, we have 2 d ~ 1 < deg^(L) < Ap d ~ l , for every 
L £ C \ {M}. Moreover deg £ (M) > Wp d , and deg £ (L) = for every L £ span({M}) \ {0, M}. It 
also follows from span({ei}) n M = that span({M}) n £ = {M}. 

For every »<[*;], set A^ 1 := C M ■ C,f\ Then by (00]) and flU]), we have 

(i) deg M (P^) > 10p d ; 

(ii) 2 d ~ l < deg M (L) < Ap^ 1 + 2p< bp^ 1 for every L € A/"; \ (& U {WJ); 

(iii) deg M (L) < |A| < 2 d - x for every L £ A \ {Wi}. 

Since A4 is connected and ei £ span(A^), we have that Mi U {ei} is also connected. Then 
Lemma 17.31 shows that C is connected. Now since Ci are connected, Lemma 17.31 implies that 
Mi = Mi U {Wj} are connected. It remains to show that they are non-isomorphic. But if Mi is 
isomorphic to Mj for some i,j £ [k], then there is a bijection between Mi and A/} that can be 
extended to an invertible T : span(Ai) — > span(A/}). Since such a function, maps to 0, and 
preserves the degrees, by (i), (ii), and (iii) above, we have T(Wi) = Wj and {T(L) : L £ £j} = Cj. 
Thus the restriction of T to £^ is an isomorphism between Cf 1 * and Cj j contradicting our 
assumption that Cf 1 * and Cj Ij are non-isomorphic. □ 
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7.2 Finishing the proof. 

We view averages tc(f) as polynomials in the variables {/ (x) : x G F™}, 

^ m 

xe(F") fc i=l 

We start by proving a technical lemma. 

Lemma 7.5. Lei Ci, . . . , 6e non-isomorphic connected systems of linear forms. Let P±, . . . , P/. 
be functions mapping every f : F™ — > C to C m £/ie following way. Every Pi is a polynomial of 
degree at most d in variables {f{x) : x G F™}. If n > d + max ie [ fc ] rank(span(£j)), and for every 
i G [A;], rank(span(£j)) > d, ifoen 

unless Pi = /or aZZ i G [fe] . 

Proof. We claim a stronger statement that if at least one of Pj is not divisible by the variable /(0), 
then 

Pi{f)tcAf) + ■■■ + Pkif)tc k {f)\ m=0 # o. 

Trivially it suffices to prove this statement for the case where for every i £ [fc], no monomial of Pj 
is divisible by /(0). Assume to the contrary that there exist polynomials Pi of degree at most d in 
variables {/(x) : x G F™} with monomials which are not divisible by /(0) such that 

Pi(f)tcAf) + ■■■ + W)tc k (f)\ m=0 = o. 

Without loss of generality assume that for some positive integer I, every Ci is a system of linear 
forms in FL Define the rank of a monomial n^eF™ f( x ) ax to be the rank of span({x : a x ^ 0}). 
Let rj denote the largest rank of a monomial with a non-zero coefficient in Pj. 

Set iq := argmaxj g [ fc ] (rj + rank(span(£j))). Let non-zero x±, . . . , x a € F™ be so that 

rank(span({xi, . . . ,x a })) = r io , 

and Y\i=i f( x i) ai where on > appears with a non-zero coefficient in Pj . Since n > d + 
rank(span(/3j )), there exists x G (Fp) fc such that {L(x) : L G Ci } are all distinct and 
span({L(x) : L G A }) n span({xi, . . . , x a }) = {0}. Note that the connectivity of £j , and the 
assumption that deg(Pj ) < rank (span (A )) implies that the monomial 

\i=i J Lec iQ 

appears with a non-zero coefficient in Pi (f)tc io (f) (i-e. there is no cancelation). 

Suppose that this monomial appears with a non-zero coefficient also for some other 1 < j < k in 
Pj(f) YiieCj /(M x '))> where x' G (Fp) fc . Then the maximality of rj +rank(span(£j )), connectivity 
of Cj, and the assumption that deg(Pj) < rank(span(£j)) shows that {L(x') : L G Cj} = {L(x) : 
L G Ci } as multisets. By the assumption that {L(x) : L G Ci } are all distinct we get that {X(x') : 
L G Cj} are also all distinct. It follows that Cj is isomorphic to Ci , which is a contradiction. □ 
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Consider a system of linear forms C and a function / : F™ — > C. Define the function f' dl ~ : F™ — > 
C, as 

The following easy lemma which follows from linearity of expectation explains the motivation for 
this notation. 

Lemma 7.6. For f,g : F™ — > C, and every system of linear forms C, we have 

j t tc(f + tg)\ t=0 = E[g(X)f dc (X) 

where X is a random variable taking values in F™ uniformly at random. 

Consider connected non-isomorphic systems of linear forms C\, . . . , L^. We claim that in order 
to prove Theorem 17. II it suffices to shows that there exists / : F™ — > (0, 1) such that f 9 ^ 1 , . . . , f 9£,k 
are linearly independent over R. 

Claim 7.7. Let f : F™ — > (0, 1) be such that f® Cl , . . . , f dCk are linearly independent over R. Then 
there exists e > such that 

{(t Cl (f), ■ ■ -,tc k (f)) + z:z£R k , \\zWoo < e] C {(t A (g), . . . ,t A ($)) : 9 : -)• (0, 1)} . 

Proof. Let ei, . . . , 6R* denote the unit vectors. Since Z^ 1 , . . . , f dl ~ k are linearly independent 
over R, for every there exists gi : F™ — > R such that 



FxeF" 



where <5jj = l\%=j] is the Kronecker delta function. For z € R fc define ^(x) = X^i=i z i9i( x )i an d 
consider the map T : R fc -»• M fc defined as 

T(z 1 ,...,z k ) = (t Cl (f + g z ), ■ ■ ■ ,tc k (f + 9z))- 

The Jacobian of T is the identity matrix, and hence invertible. By the inverse function theorem, 
for every i] > 0, {T(z) : ||z||oo < v} contains a neighborhood of T(0) = (t^if), ■ ■ ■ , tc k (f))- We 
will choose r/ > small enough such that ||/ + g z \\oo < 1 for all ||-z||qo < V- ^ 

Suppose to the contrary that for every / : F™ — > (0, 1), f dCl , . ■ ■ , f k are linearly dependent 
over R. Note that for every 1 < i < k, and for every x$ € F™, f dCi {xo) is a polynomial of degree 

— 1 in the variables {f(x) : x £ F™}. The linear dependency of f 9 ** 1 , . . . , /^ fc shows that for 
every / : F™ — > (0, 1), the k x k matrix whose ij-th entry is E is singular which 

in turn implies that the determinant of this matrix as a polynomial in {/(x) : x £ F™} is the zero 
polynomial. So the functions considered as vectors with polynomial entries are dependent over 
the field of fractions of polynomials in variables {f{x) : x € F™}. Furthermore since the degree of 
the ij-th entry of this matrix is at most + \Cj \ — 2 which does not depend on n, we conclude 
that there exist polynomials Pi,...,Pk in the variables {/(x) : x € F™}, and of degree at most 
some integer C := C(|£i|, . . . , \Ck\) (which does not depend on n) such that 

Pl(f)f 9Cl + ... + Pk(f)f dCk =0. (42) 
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Let Mi\...,Mf l be some representatives for all the isomorphism classes of 1-flagged systems of 
linear forms {(Ci \ {L}) L : i G [k],L G Ci}. Let {ctij G Z + : i G [fe], j G [/]} be such that for every 
i G [k], we have 

3=1 

By Lemma 17.41 it is possible to find a 1-flagged system of linear form C M of arbitrarily large rank 

such that for '■= A4^ j ■ C M (where j G [I]), the systems of linear forms A/} are non-isomorphic 

k[ w 3 M ^ r M 

and connected. Note that by (J38J) we have j i = f i f , and so we have 



=i je[J] 



which by (|37p implies that 



^ Cx;«<^(/))^(/)=o- 

7'efil Vi=i / 



, — , (43) 

By Lemma 1731 this shows that Y2i=i a i,jPi(f) = for every j G [/]. On the other hand since the Ci 
are non-isomorphic, (Ci \ {L}) L ^ (Cj \ {M}) M for every two distinct i, j G [/] and every L G Li 
and M G Cj. Hence for every j G [I], there is exactly one i G [fc] such that on j ^ 0. Also trivially 
for every i G [k], there exists at least one j G [/] with onj / 0. It follows that Pi = for every 
i G [fc] which is a contradiction. 



8 Concluding remarks 

In this paper we study affine invariant properties which are testable. We show that essentially every 
such property can be tested by an appropriate Gowers uniformity norm. One technical limitation 
of our techniques is that they hold only if the field size is not too small (i.e. if the Cauchy-Schwarz 
complexity is smaller than the field size). The main reason for this obstacle is that the main 
technical tools developed by the authors in [11] are limited to large fields. A recent result of Tao 
and Ziegler [16] extend the inverse Gowers theorem to small characteristics, and it is plausible that 
combining the techniques would allow to extend our results to all fields. We leave this for future 
work. 

Even if all the required technical tools were established, it would still not answer the following 
problem: is it possible to test, using a constant number of queries, whether a function / : — > ¥ p 
is correlated to a polynomial of degree d, where d > p? We know that the Gowers norm test fails, 
as it actually tests distance to a larger set of functions (non-classical polynomials). The simplest 
case which is unknown is the following: 

Problem 8.1. Let f : F% — > ¥2- Does there exist a test which queries f on a constant number of 
positions, and which can distinguish whether f has noticeable or negligible correlation with cubic 
polynomials? 

In fact, even the following simpler problem is unknown: 
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Problem 8.2. Let e > and f : — > F2. -Does i/iere exist a test which queries f on q(e) 
positions, and which can distinguish whether f has correlation at least e, or at most 5(e), with 
cubic polynomials? 
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